FIGURE 1 
ID-65 

Clone 3-60 

5 

GTGTTTATGATGAAAAAAGGACAAGTAAATGATACTAAGCAA 
TCTTACTCTCTACGTAAATATAAATTTGGTTTAGCATCAGTAA 
TTTTAGGGTCATTCATAATGGTCACAAGTCCTGTTTTTGCGGA 
TCAAACTACATCGGTTCAAGTTAATAATCAGACAGGCACTAG 

1 0 TGTGGATGCTA ATAATTCTTCC AATG AG AC AAGTGCGTC A AGT 
GTGATTACTTCCAATAATGATAGTGTTCAAGCGTCTGATAAAG 
TTGTAAATAGTCAAAATACGGCAACAAAGGACATTACTACTC 
CTTTAGTAGAGACAAAGCCAATGGTGGAAAAAACATTACCTG 
AACAAGGGAATTATGTTTATAGCAAAGAAACCGAGGTGAAAA 

1 5 ATACACCTTCAAAATCAGCCCCAGTAGCTTTCTATGCAAAGAA 
AGGTGATAAAGTTTTCTATGACCAAGTATTTAATAAAGATAAT 
GTGAAATGGATTTCATATAAGTCTTTTGGTGGCGTACGTCGAT 
ACGCAGCTATTGAGTCACTAGATCCATCAGGAGGTTCAGAGA 
CTAAAGCACCTACTCCTGTAACAAATTCAGGAAGCAATAATC 

20 AAGAGAAAATAGCAACGCAAGGAAATTATACATTTTCACATA 
AAGTAGAAGTAAAAAATGAAGCTAAGGTAGCGAGTCCAACTC 
AATTTACATTGGACAAAGGAGACAGAATTTTTTACGACCAAA 
TACTAACTATTGAAGGAAATCAGTGGTTATCTTATAAATCATT 
CAATGGTGTTCGTCGTTTTGTTTTGCTAGGTAAAGCATCTTCA 

25 GTAGAAAAAACTGAAGATAAAGAAAAAGTGTCTCCTCAACCA 
CAAGCCCGTATTACTAAAACTGGTAGACTGACTATTTCTAACG 
AAACAACTACAGGTTTTGATATTTTAATTACGAATATTAAAGA 
TGATAACGGTATCGCTGCTGTTAAGGTACCGGTTTGGACTGAA 
CAAGGAGGGCAAGATGATATTAAATGGTATACAGCTGTAACT 

30 ACTGGGGATGGCAACTACAAAGTAGCTGTATCATTTGCTGAC 
CATAAGAATGAGAAGGGTCTTTATAATATTCATTTATACTACC 
AAGAAGCTAGTGGGACACTTGTAGGTGTAACAGGAACTAAAG 
TGACAGTAGCTGGAACTAATTCTTCTCAAGAACCTATTGAAAA 
TGGTTTACCAAAGACTGGTGTTTATAATATTATCGGAAGTACT 

35 GAAGTAAAAAATGAAGCTAAAATATCAAGTCAGACCCAATTT 
ACTTTAGAAAAAGGTGACAAAATAAATTATGATCAAGTATTG 
ACAGCAGATGGTTACCAGTGGATTTCTTACAAATCTTATAGTG 
GTGTTCGTCGCTATATTCCTGTGAAAAAGCTAACTACAAGTAG 
TGAAAAAGCGAAAGATGAGGCGACTAAACCGACTAGTTATCC 

40 CAACTTACCTAAAACAGGTACCTATACATTTACTAAAACTGTA 
GATGTGAAAAGTCAACCTAAAGTATCAAGTCCAGTGGAATTT 
AATTTTCAAAAGGGTGAAAAAATACATTATGATCAAGTGTTA 
GTAGTAGATGGTCATCAGTGGATTTCATACAAGAGTTATTCCG 
GTATTCGTCGCTATATTGAAATTTAA 
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MFMMKKGQVNDTPCQSYSLRKYKFGLASVILGSFIMVTSPVFADQTTSVQVNN 
QTGTSVDANNSSNETSASSVITSNNDSVQASDKVVNSQNTATKDITTPLVETK 
5 PMVEKTLPEQGNYVYSKETEVKNTPSKSAPVAFYAK.KGDKVFYDQVFNKDN 
VKWISYKSFGGVRRYAA1ESLDPSGGSETKAPTPVTNSGSNNQEKIATQGNYT 
FSHKVEVKNEAKVASPTQFTLDKGDRIFYDQILTIEGNQWLSYKSFNGVRR.FV 
LLGKASSVEKTEDKLEKVSPQPQARITKTGRLTISNETTTGFDTLITNIKDDNGTA 
AVKVPVWTEQGGQDDIKWYTAVTTGDGNYKVAVSFADHKJSiEKGLYNIHLY 
1 0 YQEASGTLVGVTGTKVT VAGTNSSQEP1ENGLPKTGVYNIIGSTEVKNEAKISS 
QTQFTLEKGDKlNYDQVLTADGYQWISYKSYSGVflRYIPVKXLTTSSEKAKDE 
ATKPTSYPNLPKTGTYTFTKTVDVKSQPKVSSPVEFNFQKGEKIHYDQVLVVD 
GHQWISYKSYSGIRRYIEI* 

15 

Sequence description 

A) Length: 1642 bp - 547 aa (full length gene) 

B) Sequence Characteristics: 
Potential leader peptide sequence 

20 Orf is preceded by a potential Shine- 

Dalgarno sequence. 

1D-66 

25 

Clone 3-5 

ATGATATTGAGACGTCGAACTATTGTTTTATGGCAACTGGGTATCGCCATT 
TCTCTCATTCTTAGTATTCTAGCCTTAAATCTTTATTTCCATAGTACTCCCTT 

30 GCAAACCAATGCAGCTTTACGGAACCTTGCTCCTTCATTAAACCATCTTTTT 
GGGACAGATGGTTTAGGTAGGGATATGTTTGTCAGAACGATTAAAGGACT 
TTATTTCTCTCTACAAGTCGGCTTATTAGGTGCCCTTATGGGGGTCATTCTG 
GCGACAGTTTTTGGAGTGCTTGCAGGTTTAGGAAATAGCATTATTGATAAA 
ATAATAGCATGGTTAGTTGATTTGTTTATTGGTATGCCTCATTTGATTTTTA 

35 TGATTCTCATTTCTTTTGTTGTTGGGAAAGGTGCTCAAGGGGTCATCATTGC 
AACGGCTGTTACACATTGGCCTTCTTTAGCAAGGCTTATCCGCAATGAAGT 
CTATCATCTAAAGAATAAAGAATTTGTCCAACTTTCTAAAAGTATGGGAAA 
AACGCCTTATTATATTGTGAGGCATCATATCCTGCCTTTGATTGCTTCTCAA 
ATTTTCATTGGTTTTATCCTCTTATTTCCACATGTCATCCTACATGAAGCAT 

40 CAATGACTTTCTTAGGATTTGGGCTCTCTGCCGAACAACCTTCGGTTGGTA 
TCATTCTGTCAGAGGCAGCTAAGCATATCTCTCTTGGAAATTGGTGGTTGG 
TTATCTTTCCAGGACTTTATCTTATTTTGGTTGTCAATGCATTTGATACTAT 
CGGAGAATCTTTAAAGAAACTCTTTTACCCTCAAACTGATCATTTTTAG 
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MILRRRTIVLWQLGIAISLILSILALNLYFHSTPLQTNAALRNLAPSLNHLFGTD 
GLGRDMFVRTIKGLYFSLQVGLLGALMGV1LATVFGVLAGLGNSIIDK1IAWL 
VDLFIGMPHLIFMIL1SFVVGKGAQGVHATAVTHWPSLARLIRNEVYHLKNKE 
5 FVQLSKSMGKTPYYIVRHHILPLrASQIFlGFILLFPHVILHEASMTFLGFGLSAE 
QPSVGULSEAAKHISLGNWWLVIFPGLYLILVVNAFDTIGESLKKLFYPQTDHF 



10 Sequence description 

A) Length: 822 bp - 274 aa (full length gene) 

B) Sequence Characteristics: 
Potential leader peptide sequence 

1 5 Orf is preceded by a potential Shine- 

Dalgarno sequence. 

ID-78 



Clone 3-5b 

ATGACAGAAACATTATTAAGCATTAAAGACCTCTCCATCACCTTCACTCAA 
TACGGAAGATTTTTAAAACCATTTCAATCAACACCGATACAAGCGCTGA 

25 ATTTAGAAATTAAAAAAGGTGAGTTATTAGCTATTATAGGTGCTAGTGGTT 
CGGGGAAGAGTTTATTAGCACATGCTATTATGGATATTCTTCCTAAAAATG 
CATCTGTAACAGGAGATATGATTTATCGTGGTCAATCACTAAATTCTAAAC 
GCATTAAACAGTTGCGAGGAAAAGATATTACGTTGATTCCACAATCAGTTA 
ATTATTTAGATCCATCTATGAAAGTCAAACATCAGGTGCGCTTAGGTATCT 

30 CAGAAAATTCAAAGGCTACTCAAGAAGGATTGTTTCAACAGTTTGGTTTAA 
AAGAAAGTGATGGTGACTTGGATCCTTTCCAACTTTCTGGCGGAATGCTCC 
GACGTGTTTTGTTTACAACGTGTATTAGTGATAAGGTTTCTTTGATTATTGC 
GGATGAGCCCACCCCTGGATTACATCCAGATGCTCTGCAAATGGTTTTAGA 
CCAACTACGCTCCTTTGCAGATAAAGGAATAAGCGTTATATTTATCACTCA 

3 5 TG AT ATTGT AGC AGCT AGTC AA ATTGCTG ATCGT ATT ACT ATTTTT A AAG A 
GGGAAAAGCTATTGAAACAGCTCCAGCTAGTTTCTTTAGCGGAAATGGAG 
AGCAGTTACAAACAGAATTTGCTAGAAGTTTATGGCGCTCTCTCCCACAGC 
AAGAATTTTTGAAAGGAGTTACTCATGACCTTAGAGGCTAA 

40 MTETLLSIKDLSITFTQYGRFLKPFQSTPIQALNLEIKKGELLAIIGASGSGKSLL 
AHAIMDILPKNASVTGDMIYRGQSLNSKRIKQLRGKDITLIPQSVNYLDPSMK 
VKHQVRLGISENSKATQEGLFQQFGLKESDGDLDPFQLSGGMLRRVLFTTCIS 
DKVSLIIADEPTPGLHPDALQMVLDQLRSFADKGISVIFITHDIVAASQIADRITI 
FKLEGKAIETAPASFFSGNGEQLQTEFARSLWRSLPQQEFLKGVTHDLRG* 
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Sequence description 

5 

A) Length: 804 bp - 268 aa (full length gene) 

B) Sequence Characteristics: 

No obvious leader peptide sequence 
Orf is preceded by a potential Shine- 
1 0 Dalgarno sequence. 

This gene was not isolated using the LEEP 
system. However in determining a full length 
gene sequence for ID-76, this gene was 
identified downstream and fully sequenced. 

15 

ID-79 



Clone 3-5c 

20 

GTCCATCTGGGGTGGTTCCCGATTGGTATTTCTTCTCCGATAGGTACTTTGA 
GTCAAGATATTACGTTAGCTGATCGTATTAAGCACCTTATTTTACCTGTTTT 
CACGGTAAGTATTCTAGGCATTGCCAATGTAACTCTTCATACTAGAACTAA 
AATGATGTCGGTACTTTCTAGTGAATATGTCTTATTTGCCAGAGCGCGTGG 

25 GGAAACGGAATGGCAAATTTTTAAAAATCATTGTCTTAGAAATGCTATCGT 
ACCAGCTATTACACTGCATTTTTCCTATTTTGGAGAATTGTTTGGAGGATCC 
GTTCTTGCTGAGCAAGTTTTCTCATATCCAGGACTAGGGTCTACCCTAACT 
GAAGCAGGACTTAAAAGTGATACACCGCTACTTCTAGCTATTGTGATGATA 
GGGACATTATTTGTTTTTGCGGGCAATCTTATTGCGGATATTTTAAATAGC 

30 ATAATCAATCCACAGTTAAGGAGAAAAGTATGA 

VHLGWFPIGISSPIGTLSQDITLADRIKHLILPVFTVSILGIANVTLHTRTKMMSV 
LSSEYVLFARARGETEWQIFKNHCLRNAIVPAITLHFSYFGELFGGSVLAEQVF 
SYPGLGSTLTEAGLKSDTPLLLAIVMIGTLFVFAGNLIADILNSirNPQLRRKV* 

35 

Sequence description 



A) Length: 495 bp - 165 aa (partial gene sequence) 

B) Sequence Characteristics: 

40 N-terminus has yet to be determined. 

This gene was not isolated using the LEEP 
system. However in determining a full length 
gene sequence for ID-76, this gene was 
identified upstream. 
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ID-80 

5 

Clone 2-17 

TTGCGGACAATTACGTTCAAACACAATGAAACGCGATCGTCAAAAAGCGA 
AGGTAGGGCGGTAATGCTTAAAAGATTATTTACTGAAGATGGGGAATTGA 

1 0 C AA AGATTAGTCGTCGTTTCGTTTGG ATGTTAGTGGTTATCTATTGTCTTAT 
TATTGTCAGGATGTGTTTTGGGCCTCAAATTATGATTGAGGGGGTATCAAC 
TCCGAATGTTCAGCGCTTCGGAAGAATTGTAGCTCTTTTAGTACCATTTAA 
TTCTTTTCGTAGTTTAGATCAGCTAACTAGCTTTAAAGAGATTTTTTGGGTT 
ATTGGTCAAAATGTAGTGAATATTTTACTGCTGTTTCCTCTCATTATAGGGT 

1 5 TACTATCCCTAAAGCCAAGTTTACGGAAATATAAAAGCGTTATATTACTTG 
CTTTCTTGATGTCTCTTTTCATAGAGTGTACTCAAGTTGTTTTAGATATTTT 
AATAGATGCTAATCGGGTTTTTGAAATCGACGATCTATGGACAAATACCTT 
AGGCGGTCCTTTCGCCCTATGGAGTTATCGAAACATAAAAGGTTGGCTTCT 
AACTATTAGAAAATGA 

20 

MRTITFKHNETRSSKSEGRAVMLICRLFTEDGELTKISRRPVWMLVVIYCLIIVR 
MCFGPQIMIEGVSTPNVQRFGRIVALLVPFNSFRSLDQLTSFKEIFWVIGQNVV 
NILLLFPLIIGLLSLKPSLRKYKSVILLAFLMSLFIECTQVVLDILIDANRVFEIDD 
LWTNTLGGPFALWSYRN1KGWLLTIRK* 

25 

Sequence description 

A) Length: 579 bp - 193 aa (full length gene) 
30 B) Sequence Characteristics: 

Possesses a potential leader peptide sequence 

No obvious Shine-Dalgarno, but the 'TTG' codon 

may not be the actual translation start point. 

A methionine (ATG) that occurs ~22 codons 
35 downstream of the 'TTG' is preceded by a 

potential Shine-Dalgarno sequence and may 

represent the actual start codon. 

ID 81 

40 



Clone 3-1 
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TTGAAAAATTTAAATCGTTATGTAGTTGCGGTTTCTGGAGTCGTTTTACATT 
TAATGCTAGGATCAACTTATGCTTGGAGTGTGTTTCGTAACCCAATTATCT 
CAGAGACTGGTTGGGATATTTCATCAGTTTCATTCGCTTTTAGTTTGGCTAT 
TTTTTGTCTAGGAATGTCTGCAGCTTTTATGGGACACTTAGTAGAGCGTTTT 
5 GGTCCTAGGATAATGGGAATGATTTCTGCTATTTTATATGGAGCAGGGAAT 
GTGTTAACAGGCTTAGCCATTGAAACTCAGCAGTTATGGTTACTGTATGTT 
GCATACGGTATTTTAGGAGGAATCGGACTTGGTTCAGGTTATATTACTCCA 
GTATCGACTATTATTAAATGGTTTCCTGATAGGAGGGGACTAGCAACAGG 
ATTCGCTATTATGGGATTTGGCTTTGCTTCTTTAGTAACAAGTCCGCTTGCA 

1 0 CAATCCTTACTGATTAGGATTGGTGTGGGTA AAACGTTTTATATTTTGGGA 
TTAGTATATTTTTTTGTCATGATGATTGCCTCACAATTTATTAAACAACCAC 
CTCAGGAAAAAATAACTATTTTGACTCACGATGGTAAAAAGAATGCTATG 
AATTCACAAATTATCACTGGATTAAAAGCAAACGTCGCTATAAAATCAAA 
AACCTTTTACATCATTTGGTTGACCTTGTTTATTAATATTTCGTGTGGCTTA 

1 5 GGTTTAATATCAGCAGCTTCACCAATGGCACAAGATTTAGCAGGCTATTCC 
GCAGAATCTGCAGCCTTATTAGTAGGGGTACTAGGGATATTTAACGGTTTT 
GGACGTCTGTTATGGGCAAGTCTCTCTGACTACATTGGACGCCCGTTGACC 
TTTATAATATTATTTATTGTGAACTTTATTATGACTTCTAGTTTATTTTTGTC 
ATTCAATGCTATTGTATTTGCAATAGCGATGTCTATTTTAATGACTTGTTAT 

20 GGTGCAGGTTTTTCCTTATTACCTGCTTATCTAAGTGATATTTTTGGAACAA 
AGGAATTAGCTACTTTACATGGTTATAGTTTAACAGCATGGGCAATAGCAG 
GTCTGTTTGGGCCCCTATTGTTATCAAAGACATATTCATGGGGAAATTCCT 
ATCAATTGACATTAATGGTTTTTGGTTTTTTATTCTTATTCGGATTATTGTTA 
TCTCTATATTTAAGAAAATTAACAACTAAAGTTGTGTAG 

25 

LKNLNRYVVAVSGVVLHLMLGSTYAWSVFRNPIISETGWDISSVSFAFSLAIFC 
LGMSAAFMGHLVERFGPRIMGMISAILYGAGNVLTGLAIETQQLWLLYVAYG 
ILGGIGLGSGYITPVSTIIKWFPDRRGLATGFATMGFGFASLVTSPLAQSLLIRIG 
VGKTFYILGLVYFFVMMIASQFIKQPPQEKITILTHDGKKNAMNSQIITGLKAN 
30 VAIKSKTFYI1WLTLF1NISCGLGLISAASPMAQDLAGYSAESAALLVGVLGIFN 
GFGRLLWASLSDYIGRPLTF1ILFIVNFIMTSSLFLSFNAIVFA1AMSILMTCYGA 
GFSLLPAYLSDIFGTKELATLHGYSLTAWAIAGLFGPLLLSKTYSWGNSYQLTL 
M VFGFLFLFGLLLS L YLRKLTTKV V * 

35 

Sequence description: 

A] Length 1221 bp - 407 a.a (full length 
gene). 

40 B] TTG start codon with Shine-Dalgarno 

sequence upstream. Obvious signal peptide, 
with hydropathy plot exhibiting many possible 
membrane spanning regions, indicating protein 
to be transmembrane. 
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Clone 48 

1 0 ATGGCAGATAAAAACAGAACATTTAAACTTGTAGGTGCAGGATCTTCTAG 
CACACAAGAAAAAATTGAAAAGCCTGCTCTTTCGTTTATGCAAGATGCGTG 
GCGTCGCTTGAAAAAAAACAAATTAGCAGTAGTTTCACTCTATTTATTAGC 
TCTTTTACTTACTTTTTCGTTAGCCTCAAATTTATTTGTAACTCAGAAGGAT 
GCTAATGGGTTTGATTCGAAAAAAGTAACGACATATCGCAACTTACCACCT 

1 5 AAATTGAGTTCAAACCTTCCTTTTTGGAATGGTAGCATTAATCCATCA 

MADKNRTFKLVGAGSSSTQEKIEKPALSFMQDAWRRLKKNKLAVVSLYLLA 
LLLTFSLASNLFVTQKDANGFDSKKVTTYRNLPPKLSSNLPFWNGSINPS 

20 

Sequence description: 

A] Current length is 303 bp - 101 aa 

B] No obvious signal peptide but Shine 

25 Dalgarno sequence upstream of the ATG start 

codon. Not ide3ntified directly using the LEEP system but was found 
directly downstream of ID-34 described in WO 00/06736. 



30 ID-83 



Clone 98 

35 

ATGAAAATAGTAGTACCAGTAATGCCTCGCAGTCTTGAAGAGGCTCAAGA 
AATAGATTTATCAAAATTTGATAGTGTTGATATTATTGAATGGCGAGCTGA 
TGCCTTACCAAAGGATGACATTATTAATGTAGCTCCAGCTATTTTTGAGAA 
ATTCGCAGGTCATGAAATTATTTTTACTTTTCGTACAACGCGTGAAGGTGG 
40 TAATATTGTCTTATCTGATGCTGAGTATGTTGAGTTAATCCAGAAAATTAA 
TTCTATCTACAATCCAGATTATATTGATTTTGAGTATTTTTCACATAAAGAA 
GTTTTTCAAGAAATGCTAGAATTTCCAAATTTAGTCCTGTCTTATCACAATT 
TTCAAGAGACACCGGAGAATATTATGGAGATATTTTCAGAATTAACAGCC 
CTAGCACCACGAGTTGTGAAAATCGCAGTAATGCCAAAGAATGAACAAGA 
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TGTCTTAGACGTTATGAATTACACTCGCGGTTTCAAGACTATTAATCCTGA 
TCAAGTTTATGCGACGGTATCTATGAGTAAAATTGGACGTATTTCTCGTTTT 
GCTGGTGATGTAACTGGATCTAGTTGGACATTTGCATATTTAGATTCATCT 
ATCGCACCCGGACAAATTACTATTTCAGAGATGAAGCGTGTCAAAGCATT 
5 GCTTGACGCTGACTGA 

MK1VVPVMPRSLEEAQEIDLSKFDSVDIIEWRADALPKDDIINVAPAIFEKPAG 
HETIFTFRTTREGGNIVLSDAEYVELIQKINSIYNPDYIDFEYFSHKEVFQEMLEF 
PNLVLSYHNFQETPENIMEIFSELTALAPRVVKIAVMPKNEQDVLDVMNYTRG 
10 FKT1NPDQVYATVSMSKIGRISRFAGDVTGSSWTFAYLDSSIAPGQITISEMKRV 
KALLDAD* 



Sequence description: 

15 

A] Length 678 bp, 225 aa (full length gene) 

B] No obvious signal peptide, but there is a 
Shine Dalgarno immediately upstream of ORF. 

20 

ID-84 



Clone RS-52 

25 

ATGAAAGACTTATTTGCAACAACAGAAGCATCATCAAGGAAACAGGAACA 
AGATAGAATTGTCAATTACATAAAACAACATGTTGAGTTAACAAATGGTA 
ATCAAATAAAAAAAATTGAGTTTATCGACTTTCAAAAAAATGAGATGACA 
30 GGTACATGGGGAATTTCTACTAAAATTAATGAACAATTTTCGATTAGTTTT 
TCTGAAGATAGAATTGGTGGTAAACTTAGAGCATTAGGATATCAACCGAA 
TGAAATAGGTTTTTCAAAGGACATCAATAGTAATAATCAAAATGTTAATGA 
TATTGAAGTGATTTATATGAAGAAAGAATAG 

35 MKDLFATTEASSRKQEQDRIVNYIKQHVELTNGNQIKKIEF1DFQKNEMTGTW 
GISTK1NEQFSISFSEDR1GGKLRALGYQPNEIGFSKDINSNNQNVNDIEVIYMK 
KE* 

Sequence description: 

40 

A] length: 333 bp - 11 1 aa (partial sequence) 

B] No obvious Shine Dalgarno sequence upstream 
of the ATG start codon, and no obvious signal 
peptide within the protein. 
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ID-85 



Clone RS-53 



1 0 ATGAAAAAACGTATATGGTATTTGATA ATAATAATCAC AGTAATTTTAGG A 
GGACTAGCCATGAAAAACTTATTTGCAACAACAGAAGCATCATCAAGGAA 
ACAGGAACAAGATAGAATTGTCAATTACATAAAACAACATGTTGAGTTAA 
CAAATGGTAATCAAATAAAAAAAATTGAGTTTATCGACTTTCAAAAAAAT 
GAGATGACAGGTACATGGGGAATTTCTACTAAAATTAATGAACAATTTTCG 

1 5 ATTAGTTTTTCTGAAGATAGAATTGGTGGTAAACTTAGAGCATTAGGATAT 
CAACCGAATGAAATAGGTTTTTCAAAGGACATCAATAGTAATAATCA 

MKKRIWYLIIIITVILGGLAMKNLFATTEASSRKQEQDRIVMY1KQHVELTNGN 
QIKKIEFLDFQKNEMTGTWGISTKINEQFSISFSEDRIGGKLRALGYQPNEIGFSK 
20 DTNSNNQ 



Sequence description: 

25 A] Length: 351 bp - 1 17 aa (Partial sequence) 

B] Obvious signal peptide and Shine Dalgarno 
sequence upstream of the ATG start codon. 



30 

ID-86 

Clone rD-74 

3 5 ATGTC AAATC AATATGATTATATCGTTATTGGTGG AGGTAGT 
GCAGGCAGTGGTACCGCTAATAGGGCAGCCATGTATGGAGC 
AAAAGTCCTGTTAATTGAAGGTGGACAAGTAGGTGGAACTTG 
TGTTAACTTAGGTTGTGTACCTAAGAAAATCATGTGGTATGG 
TGCACAAGTTTCTGAGACACTCCATAAGTATAGTTCAGGTTA 

40 TGGTTTTGAAGCCAATAATCTTAGTTTTGATTTTACTACTCTA 
AAAGCTAATCGCGATGCTTACGTGCAGCGGTCTAGACAGTCG 
TATGCCGCTAATTTTGAGCGTAATGGGGTCGAAAAGATTGAT 
GGATTTGCTCGTTTTATTGATAACCATACTATTGAAGTGAATG 
GTCAGCAATATAAAGCTCCTCACATTACTATTGCAACAGGTG 
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GACACCCTCTTTACCCTGATATTATTGGAAGTGAACTTGGTG 
AGACTTCTGATGATTTTTTTGGATGGGAGACCTTACCAAATTC 
TATATTGATTGTTGGGGCGGGCTATATCGCGGCAGAACTTGC 
TGGAGTGGTTAATGAATTAGGCGTTGAAACCCATCTTGCATT 
5 TAGAAAAGACCATATTCTACGCGGATTTGATGACATGGTAAC 
AAGTGAGGTTATGGCTGAAATGGAGAAATCAGGTATCTCTTT 
ACATGCTAACCATGTACCTAAATCTCTTAAACGCGATGAAGG 
TGGCAAGTTGATTTTTGAAGCTGAAAATGGGAAAACGCTTGT 
CGTTGATCGTGTAATATGGGCTATCGGCCGTGGACCAAATGT 

1 0 AGACATGGGACTTGAAAATACCGATATTGTTTTAAATGATAA 
AGATTATATCAAAACAGATGAATTTGAGAATACTTCTGTAGA 
TGGCGTGTATGCTATTGGAGATGTTAATGGGAAAATTGCCTT 
GACACCGGTAGCAATTGCAGCAGGTCGTCGCTTATCAGAAAG 
ACTTTTTAATCATAAAGATAACGAAAAATTAGATTACCATAA 

1 5 TGTACCTTCAGTTATTTTTACTCACCCTGTAATTGGGACGGTA 
GGACTTTCAGAAGCAGCAGCTATCGAGCAATTTGGAAAAGAT 
AATATCAAAGTCTATACATCAACTTTTACCTCTATGTATACGG 
CTGTTACCAGTAATCGCCAAGCAGTTAAGATGAAGCTCATAA 
CCCTAGGAAAAGAGGAAAAAGTTATTGGGCTTCATGGTGTTG 

20 GTTATGGTATTGATGAAATGATTCAAGGTTTTTCAGTTGCTAT 
CAAAATGGGGGCTACTAAAGCAGACTTTGATGATACTGTTGC 
TATTCACCCAACTGGATCTGAGGAATTTGTTACAATGCGCTA 
A 

25 MSNQYDYIVIGGGSAGSGTANRAAMYGAKVLLIEGGQVGGTC 
VNLGCVPKKIMWYGAQVSETLHKYSSGYGFEANNLSFDFTTLK 
ANRDAYVQRSRQSYAANFERNGVEKIDGFARFIDNHTIEVNGQ 
QYKAPHITIATGGHPLYPDIIGSELGETSDDFFGWETLPNSILTVG 
AGYIAAELAGVVNELGVETHLAFRKDHILRGFDDMVTSEVMAE 

30 MEKSGISLHANHVPKSLKRDEGGKLIFEAENGKTLVVDRVIWAI 
GRGPNVDMGLENTDIVLNDKDYIKTDEFENTSVDGVYAIGDVN 
GKIALTPVAIAAGRRLSERLFNH1CDNEKLDYHNVPSV1FTHPV1G 
TVGLSEAAA1EQFGKDNIKVYTSTFTSMYTAVTSNRQAVKMKLI 
TLGKEEKVIGLHGVGYGIDEMIQGFSVAIKMGATKADFDDTVAI 

35 HPTGSEEFVTMR* 



40 



ID-87 



Clone RS-55 



ATGACAAAAAAACATCTTAAAACGCTTGCCTTGGCACTTACTACAGTATCA 
5 GTAGTGACATACAGCCAGGAGGTATATGGATTAGAAAGAGAGGAATCGGT 
CAAACAAGAACAAACCCAGTCAGCTTCAGAAGATGATTGGTTCGAAGAAG 
ATAATGAGAGGAAAACAAATGTTTCTAAAGAGAATTCTACTGTTGATGAA 
ACAGTTAGTGATTTATTTTCTGATGGAAATAGTAATAACTCTAGTTCTAAA 
ACCGAGTCAGTGGTAAGTGACCCTAAACAAGTCCCCAAAGCAAAACCAGA 

1 0 GGTTACAC AAGAAGCAAGCA ATTCTAGTAATG ATGCTAGC AAAGTAG A AG 
TACCAAAACAGGATACAGCTTCAAAAAAGGAAACTCTAGAAACATCAACT 
TGGGAGGCAAAAGATTTCGTAACTAGAGGGGATACTTTAGTAGGTTTTTCA 
AAATCTGGAATTAATAAGTTATCTCAAACATCACACTTGGTTTTACCAAGT 
CATGCAGCAGATGGAACTCAATTGACACAAGTAGCTAGCTTTGCTTTTACT 

1 5 CCAGATAAAAAG ACGGCCATTGCAGAATATACAAGTAGGCTAGGAGAAA 
ATGGGAAACCGAGTCGTTTAGATATTGATCAGAAGGAAATTATTGATGAG 
GGAGAAATATTTAATGCTTACCAGTTGACTAAGCTTACTATTCCAAATGGT 
TATAAGTCTATTGGTCAAGATGCTTTTGTGGACAATAAGAATATTGCTGAG 
GTTAACCTTCCTGAGAGTCTCGAGACTATTTCAGACTATGCTTTTGCTCACA 

20 TGTCTTTAAAACAAGTAAAGTTACCAGATAACCTAAAGGTCATTGGAGAA 
TTAGCTTTTTTTGATAATCAGATTGGTGGTAAGCTTTACTTGCCACGTCACT 
TGATAAAATTAGCAGAACGCGCTTTCAAATCTAATCGTATTCAAACAGTTG 
AATTTTTGGGAAGTAAGCTTAAGGTTATAGGAGAAGCAAGTTTTCAAGAT 
AATAATCTGAGGAATGTTATGCTTCCGGATGGACTTGAAAAAATAGAATC 

2 5 AG AAGCTTTT AC AGG AA ATCC AGG AG ATG AAC ATT AC AAC A ATC AGGTTG 

TATTGCGCACAAGGACAGGCCAAAATCCACATCAACTTGCGACTGAGAAT 
ACTTACGTCAATCCGGACAAATCATTGTGGCGTGCAACACCTGATATGGAT 
TATACCAAATGGTTAGAGGAAGATTTTACCTATCAAAAAAATAGTGTTACA 
GGTTTTTCAAATAAAGGCTTACAAAAGGTAAGACGTAATAAAAACTTAGA 
30 AATTCCAAAACAACACAATGGTATTACTATTACTGAAATTGGTGATAACGC 
TTTTCGCAATGTTGATTTTCAAAGTAAAACTTTACGTAAATATGATTTGGA 
AGAAATAAAGCTCCCCTCAACTATTCGGAAAATAGGTGCTTTTGCTTTTCA 
ATCTAATAACTTGAAATCCTTTGAAGCAAGTGAAGATTTAGAAGAGATTA 
AAGAGGGAGCCTTTATGAATAATCGTATTGGAACTCTAGACTTGAAAGAC 

3 5 AAACTT ATC AAAAT AGGTGATGCTGCTTTCC ATATTA ATC ATATTTATGCC 

ATTGTTCTTCCAGAATCTGTACAAGAAATAGGACGTTCAGCTTTTCGACAA 
AATGGTGCGCTTCACCTTATGTTTATCGGAAATAAGGTTAAAACAATTGGT 
GAAATGGCTTTTTTATCCAATAAACTGGAAAGTGTAAATCTCTCTGAGCAA 
AAACAATTAAAGACAATTGAGGTCCAAGCTTTTTCGGATAATGCCCTTAGT 
40 GAAGTAGTCTTACCGCCAAATTTACAGACTATTCGTGAAGAGGCTTTCAAA 
AGGAATCATTTGAAAGAAGTGAAGGGTTCATCTACATTATCTCAGATTACT 
TTTAATGCTTTTGATCAAAATGATGGGGACAAACGCTTTGGTAAGAAAGTG 
GTTGTTAGGACACATAATAATTCTCATATGTTAGCAGATGGTGAGCGTTTT 
ATCATTGATCCAGATAAGCTATCTTCTACAATGGTAGACCTTGAAAAGGTT 
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TTAAAAATAATCGAAGGTTTAGATTACTCTACATTACGTCAGACTACTCAA 
ACTCAGTTTAGAGAAATGACTACTGCAGGTAAAGCGTTGTTATCAAAATCT 
AACCTCCGACAAGGAGAAAAACAAAAATTCCTTCAAGAAGCACAATTTTT 
CCTTGGTCGCGTTGATTTGGATAAAGCCATAGCTAAAGCTGAGAAGGCTTT 
5 AGTGACCAAGAAGGCAACAAAGAATGGTCATTTGCTTGAGAGGAGTATTA 
ACAAAGCGGTATTAGCTTATAATAATAGTGCTATTAAAAAAGCTAATGTTA 
AGCGCTTGGAAAAAGAGTTAGACTTGCTGACAGATTTAGTCGAGGGAAAA 
GGACCATTAGCGCAAGCTACAATGGTACAAGGAGTTTATTTATTAAAGAC 
GCCTTTACCATTGCCAGAATATTATATCGGATTGAACGTTTATTTTGACAA 

1 0 GTCTGGAAAATTG ATTTATGCACTTGATATGAGTGATACTATTGGCGAGGG 
ACAAAAAGATGCATATGGTAATCCTATATTAAATGTTGACGAGGATAATG 
AAGGTTATCATACCTTGGCAGTTGCCACTTTAGCTGATTATGAAGGTCTTT 
ATATTAAAGATATTTTAAATAGTTCCCTTGATAAGATTAAAGCAATACGCC 
AGATTCCTTTGGCAAAATATCATAGATTAGGAATTTTCCAAGCTATCCGAA 

1 5 ATGCAGCGGCAGAAGCAGACCGATTGCTTCCTAAGACACCTAAGGGGTAC 
CTAAATGAAGTCCCAAATTATCGTAAAAAACAAATGGAGAAAAATTTAAA 
ACCAGTTGATTATAAAACGCCGATTTTTAATAAGGCTTTACCTAATGAAAA 
GGTAGACGGTGATAGAGCGGCTAAAGGTCATAATATAAATGCGGAGACTA 
ATAATTCTGTAGCTGTAACACCAATAAGGTCCGAGCAGCAATTACATAAGT 

20 CACAGTCTGATGTAAATTTACCTCAAACAAGTTCTAAAAATAATTTTATAT 
ACGAGATTCTAGGATACGTTAGTTTATGTTTGCTTTTCCTAGTAACTGCTGG 
GAAAAAAGGAAAACGAGCAAGAAAATAA 

MTKKHLKTLALALTTVSVVTYSQEVYGLEREESVKQEQTQSASEDDWFEEDN 

25 ERKTNVSKENSTVDETVSDLFSDGNSNNSSSKTESVVSDPKQVPKAKPEVTQE 
ASNSSNDASKVEVPKQDTASKKETLETSTWEAKDFVTRGDTLVGFSKSGINKL 
SQTSHLVLPSHAADGTQLTQVASFAFTPDKKTAIAEYTSRLGENGKPSRLDIDQ 
KEIIDEGEIFNAYQLTKLTTPNGYKSIGQDAFVDNKNIAEVNLPESLETrSDYAF 
AHMSLKQVKLPDNLKVIGELAFFDNQIGGKLYLPRHLIKLAERAFKSNRIQTV 

30 EFLGSKLKVIGEASFQDNNLRNVMLPDGLEKJESEAFTGNPGDEHYNNQVVLR 
TRTGQNPHQLATENTYVNPDKSLWRATPDMDYTKWLEEDFTYQKNSVTGFS 
NKGLQKVRRNKNLEIPKQHNGITITETGDNAFRNVDFQSKTLRKYDLEEIKLPS 
TIRKIGAFAFQSNNLKSFEASEDLEEIKEGAFMTvrNRIGTLDLKDKLIKIGDAAFH 
INHIYAIVLPESVQEIGRSAFRQNGALHLMFIGNKVKTIGEMAFLSNKLESVNL 

35 SEQKQLKTIEVQAFSDNALSEVVLPPNLQTIREEAFKRNHLKEVKGSSTLSQITF 
NAFDQNDGDKRFGKXVVVRTHNNSHMLADGERF1IDPDKLSSTMVDLEKVL 
KIIEGLDYSTLRQTTQTQFREMTTAGKALLSKSNLRQGEKQKFLQEAQFFLGR 
VDLDKAIAKAEKALVTKKATKNGHLLERSINKAVLAYNNSAIKKANVKRLEK 
ELDLLTDLVEGKGPLAQATMVQGVYLLKTPLPLPEYYIGLNVYFDKSGKLIYA 

40 LDMSDTIGEGQKDAYGNPILNVDEDNEGYHTLAVATLADYEGLYIKDILNSSL 
DKIKAIRQIPLAKYHRLG1FQAIRNAAAEADRLLPKTPKGYLNEVPNYRKKQM 
EKNLKPVDYKTPIFNKALPNEKVDGDRAAKGHNINAETNNSVAVTPIRSEQQL 
HKSQSDVNLPQTSSKNNFIYEILGYVSLCLLFLVTAGKKGKRARK* 
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Sequence description: 

5 A] Length 3168 bp - 1056 aa (Partial sequence) 

B] Obvious signal peptide with Shine Dalgarno 
sequence upstream of the ATG start codon. 



10 

ID-88 



Clone RS-56 

15 

GCAGGATACATCATGCACAAGCACGAGGCTATCGTGTCATGCTGGGGTCA 
ACCCAGGAAGACATGTCGGCACAAGCTGAAGATTTCTTTACAGTCTGTACA 
CAATAAAGAGACGGGTAAGAGCGCTTTTAATGACAAAGAACGACTAGCAA 
20 TT 

AGY1MHKHEA1VSCWGQPRKTCRHKLK1SLQSVHNKETGKSAFNDKLERLAI 



25 Sequence description: 

A] Length: 153 bp - 51 aa (partial sequence) 

B] No signal peptide visible, insufficient 
sequence data to determine the presence of a 

30 Shine Dalgarno sequence. 



ID-89 

35 

Clone RS-58 

GTGTCATTTATGCAAAGAAAATCCTATTTAAAATCCATGAGTGTTCTTACT 
40 TTAACAGCTTGTCTTATATCAGGATATGTGGTTAAAGATATTGCTATGTTA 
CATGCAGTATCTGCCAGTGAGAAGAAAGCAAATAATGTCAGTCCGAGAGA 
AAATCTCTACAGGGCTGTCAATGATAATTGGCTAGCCAATACAAAACTCA 
AACAAGGGCAGACTAGTGTTAATAGTTTTTCAGAAATTGAGGATAAATTA 
AAGCAACTGTTAGTGTCTGATATGGCTAAAATGGCCTCAGGAAAGATTGA 
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AACAACCAATGATGAACAGAAAAAAATGGTTGCATACTATAAACAAGGTA 

TGGACTTTAAAACAAGAGATAAAAATGGTCTCAAACCTCTAAAACCAGTT 

TTACAAAAACTTGAAGCAGTCTCTTCAATGAAAGACTTTCAAAGTTTGGCC 

CATGATTTTGTGATGAGTGGTTTTGTTTTACCATTTGGTTTGACTGTGGAAA 

CCAATGCTCGAGATAATAGCCAAAAGCAATTGGTGCTTCGTCAAGCACCC 

GCATTACTTGAATCACCTGACCAATATAAGAAGGGCAATAAAGAAGGTGA 

GGCTAAATTATCAGCTTACCGTACTTCAGCAATGGCTTTGCTTAAACAAGC 

TGGAAAAAGTAACATTGAAGATAGAAAACTAGTTAAACAAGCTATAGCAT 

TTGATAGACTCTTATCAGAAAAAACGCAAGTTGATCAAAGTAAAATCACA 

GCTGAAAGTGAGACAGCTGCGGGGCGATATAACCCTGAAAGTATGGAAAC 

GGTTCACAATTACGCCAAGGAATTTGACTTTAAAGAATTGATTGAAAAACT 

AGTTGGGCCAACGAATAAGGCAGTCAATGTAGAAGATAAAACTTATTTTA 

AACAGGTTAATGATGTTATAAATAGTAAACAATTAGCCAATATGAAAGCA 

TGGATGATGATTTCTATGCTAGTTGATCAATCAGATTTTCTAGGAGAACAA 

AATCGTCAAGCAGCGAGTGCTTTTAAGAATGTTGCGTCTGGTTTGACTCAG 

ATTGAATCGAAAGAAAAAATGCTTACACCCAATTAG 



MSFMQRKSYLKSMSVLTLTACLISGYVVKDIAMLHAVSASEKKANNVSPREN 

LYRAVNDNWLANTKLKQGQTSVNSFSEIEDKLKQLLVSDMAKMASGKIETTN 

DEQKKMVAYYKQGMDFKTRDKNGLFCPLKPVLQKLEAVSSMKDFQSLAHDF 

VMSGFVLPFGLTVETNARDNSQKQLVLRQAPALLESPDQYKKGNKEGEAKLS 

AYRTSAMALLKQAGKSNIEDRKLVKQAIAFDRLLSEKTQVDQSKITAESETAA 

GRYNPESMETVHNYAKEFDFKELIEKLVGPTNKAVNVEDKTYFKQVNDVINS 

KQLANMK^WMMISMLVDQSDFLGEQNRQAASAFKNVASGLTQIESKEKMLT 

PN* 



Sequence description: 



A] Length: 1095 bp - 365 aa (full length gene) 

B] an GTG (possible ATG start codon located 7 bp 
further downstream) start codon with an obvious 
signal peptide. Shine Dalgarno sequence present 
upstream of the ORF. 



1D-90 



Clone RS-59 



* .j ->, f~ ;.- ijf jp^?' if~i" s^s I^? 1 
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ATGGAAATGCCTAAAAGAAATGAATTACTCAATAAAGAAATTAAAATGAG 
TATTGATAAACTTAGATATAAAGAACCAGAGAGTGAACATGACAAGCGAC 
CTACTTTTTATTTGGTAGTACTTATACTTGTTACTGTAGCAGTTATATTGTC 
GTTATTTAAATATTTTTTATAG 

MEMPKRNELLNKEIKMSIDKLRYKJEPESEHDICRPTFYLVVLILVTVAVILSLFK 
YFL* 



10 Sequence description: 

A] Length: 174 bp - 58 aa(full length gene) 

B] No obvious signal peptide, but Shine 
Dalgarno sequence is present upstream of ATG 

1 5 start codon. 



20 ID-91 



Clone RS-62 (partial sequence) 

25 

ATGCAGGTATTTTTAAATATTGTCAATAAATTCTTTGATCCAGTTATTCATA 
TGGGTTCGGGAGTTGTGATGCTAATTGTCATGACAGGTTTAGCCATGATAT 
TTGGAGTGAAGTTTTCTAAAGCACTTGAAGGTGGTAT 

30 MQVFLNIVNKFFDPVIHMGSGVVMLIVMTGLAMIFGVKFSKALEGG 



Sequence description: 

35 A] Length: 141 bp - 41 aa (partial sequence 

B] Shine Dalgarno sequence present upstream of 
ATG start codon with a possible signal peptide 
present 



ID-92 
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Clone RS-69 (partial sequence) 



ATGAAAAAGAAAACATTCAGTGCTTATAACTTTTTAACGGCTCTTATCCTT 
5 TGTCTTTTGACAGTGCTTTTTATCTTTCCATTTTATTGGATTATGACAGGAG 
CTTTTAA 

MKKKTFSAYNFLTALILCLLTVLF1FPFYW1MTGAF 

10 

Sequence description: 

A] Length: 1 10 bp -36 aa (Partial sequence) 

B] Possible signal peptide with Shine Dalgarno 
1 5 sequence directly upstream of the ATG start 

codon. 



20 

ID-93 



Clone RS-70 

25 

ATGACTGAGAACTGGTTACATACTAAAGATGGTTCAGATATTTATTATCGT 

GTCGTTGGTCAAGGTCAACCGATTGTTTTTTTACATGGCAATAGCTTAAGT 

AGTCGCTATTTTGATAAGCAAATAGCATATTTTTCTAAGTATTACCAAGTT 

30 ATTGTTATGGATAGTAGAGGGCATGGCAAAAGTCATGCAAAGCTAAATAC 
CATTAGTTTCAGGCAAATAGCAGTTGACTTAAAGGATATCTTAGTTCATTT 
AGAGATTGATAAAGTTATATTGGTAGGCCATAGCGATGGTGCTAATTTAGC 
TTTAGTTTTTCAAACGATGTTTCCAGATATGGTTAGAGGGCTTTTGCTTAAT 
TCAGGGAACCTGACTATTCATGGTCAGCGATGGTGGGATATTCTTTTAGTA 

3 5 AGGATTGCCT AT AAATTCCTTC ACTATTTAGGG AAACTCTTTCCGTAT ATG 
AGGCAAAAAGCTCAAGTTATTTCGCTTATGTTGGAGGATTTGAAGATTAGT 
CCAGCTGATTTACAGCATGTGTCAACTCCTGTAATGGTTTTGGTTGGAAAT 
AAGGACATAATTAAGTTAAATCATTCTAAGAAACTTGCTTCTTATTTTCCA 
AGGGGGGAGTTTTATTCTTTAGTTGGCTTTGGGCATCACATTATTAAGCAA 

40 GATTCCCATGTTTTTAATATTATTGCAAAAAAGTTTATCAACGATACGTTG 
AAAGGAGAAATTGTTGAAAAAGCTAATTGA 



MTENWLHTKDGSDIYYRVVGQGQPIVFLHGNSLSSRYFDKQ1AYFSKYYQVIV 
MDSRGHGKSHAKLNTISFRQIAVDLKDILVHLEIDKVILVGHSDGANLALVFQ 
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TMFPDMVRGLLLNSGNLTIHGQRWWD1LLVR1AYKFLHYLGKLFPYMRQKA 

QVISLMLEDLKISPADLQHVSTPVMVLVGNKDIIKLNHSKKLASYFPRGEFYSL 

VGFGHHI1KQDSHVFN11AICKFINDTLKGEIVEKAN* 



Sequence description: 

A] Length: 744 bp - 248 aa (full length gene) 

B] No obvious signal peptide, but Shine 
Dalgarno sequence upstream of the ATG start 
codon. 



ID-94 



Clone RS-71 



ATGGTAGCAAAAGAGTTAGGTAAAAATAGCTTTACTATCCCAACTATTTGT 

TCTAATTGCTCCGCAGGTACTGCCATTGCAGTTGTATATAATGATGACCAT 

TCTTTCTTAAGATACGGCTATCCCGAGTCTCCACTTCATATTTTTATCAATA 

CACGGATCATTGCACAGGCACCAAGCAAATATTTTTGGGCTGGTATTGGGG 

ACGGTATTTCAAAAGCCCCTGAAGTAGAACGTGCTACCTTAGAGGCTAAG 

ACCAATAAACTACCACATACTGCAGTGTTAGGACAAGCAGTCGCTCTGTCT 

TCAAAGGAAGCTTTTTATCAATTTGGTGAACAAGGTCTAAAAGACGTTGAA 

GCTAATTTAGCTTCGCGTGCAGTTGAAGAAATTGCGCTTGATATCTTA 

MVAKELGKNSFTIPTICSNCSAGTAIAVVYNDDHSFLRYGYPESPLHIFINTRIIA 

QAPSKYFWAGIGDG1SKAPEVERATLEAKTNKLPHTAVLGQAVALSSKEAFY 

QFGEQGLKDVEANLASRAVEEIALDIL 



Sequence description: 

A] Length: 405 bp - 135 aa (Partial sequence) 

B] No obvious Shine Dalgarno sequence upstream 
of the ATG start codon, probable signal 
peptide present at the N-terminus. 



ID-95 
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Clone RS-73 



5 TTGAGGGAAACTTACTGGAAAATTTCAAGCGATTGCGATAAAATAAATCTT 
GCAGAGTTTTCTAGAGAAAGGAGGTCAGATTTATTGGAGTGGCAAGATCT 
AGCGCAGTTACCTGTATCTATTTTTAAAGACTATGTTACAGATGCTCAAGA 
CGCGGAAAAACCTTTTATATGGACAGAAGTATTTTTAAGGGAGATTAATCG 
CTCAAATCAAGAAATTATTTTGCATATTTGGCCGATGACTAAGACAGTCAT 

1 0 TCTGGGGATGTTAGATCG AGAATTACC AC ATTTAGAATTAGCTAAAA AAG 
AAATCATCAGTCGTGGTTATGAACCAGTTGTTCGGAATTTTGGAGGTCTCG 
CAGTTGTAGCTGATGAAGGAATTTTAAATTTTTCATTGGTTATTCCAGATGT 
TTTTGAGAGAAAATTGTCTATCTCAGATGGGTATCTTATAATGGTCGATTTT 
ATTAGAAGTATATTTTCGGATTTTTATCAACCTATTGAGCACTTTGAAGTA 

1 5 GAGACCTCCTATTGTCCTGGTAAGTTTGATCTTAGTATAAATGGCAAAAAA 
TTTGCTGGCTTGGCTCAGCGCCGTATAAAGAATGGTATTGCGGTATCAATT 
TACCTTAGCGTTTGTGGCGATCAAAAAGGGCGGAGTCAAATGATTTCAGAT 
TTTTATAAGATTGGTCTAGGTGATACGGGTAGTCCAATTGCTTATCCAAAT 
GTAGATCCTGAAATTATGGCTAATCTATCTGATCTATTAGATTGTCCTATG 

20 ACAGTAGAAGATGTTATTGATCGTATGTTGATTAGCCTTAAACAAGTAGGT 
TTTAATGATCGTTTACTGATGATTAGACCCGATTTAGTTGCAGAGTTTGAT 
AGATTTCAGGCTAAGTCTATGGCTAATAAGGGGATGGTGAGCAGAGATGA 
ATAA 

25 

MRETYWKISSDCDKINLAEFSRERRSDLLEWQDLAQLPVS1FKDYVTDAQDAE 
KPFIWTEVFLREINRSNQEIILHIWPMTKTVILGMLDRELPHLELAKKEIISRGYE 
PVVRNFGGLAVVADEGILNFSLVIPDVFERKLSISDGYLIMVDFIRSIFSDFYQPI 
EHFEVETSYCPGKFDLSINGKKFAGLAQRRIKNGIAVSIYLSVCGDQKGRSQMI 
30 SDFYKIGLGDTGSPIAYPNVDPEEMANLSDLLDCPMTVEDVIDRMLISLKQVGF 
NDRLLMIRPDLVAEFDRFQAKSMANKGMVSRDE* 



Sequence description: 

35 

A] Length: 921 bp -307 aa (Full-length gene sequence) 

B] No obvious Shine Dalgarno sequence upstream 
of the TTG start codon or signal peptide 
visible. Actual start point may be a further 

40 85 bp downstream (TTG). This start point is 

preceded by a typical Shine-Dalgarno sequence. 
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ID-96 



Clone RS-74 

5 

TTGGAAGGTTTACTTATTGCATTGATTCCCATGTTTGCGTGGGAAAGTATT 
GGATTTGTTAGTAATAAAATTGGAGGGCGTCCAAATCAACAAACATTTGG 
AATGACTTTAGGAGCATTGCTATTTGCGATTATCGTATGGTTATTTAAACA 

1 0 GCCAGAGATG ACTGCCTCATTGTGGATTTTTGGTATCTT AGGTGGT ATCCT 
ATGGTCAGTCGGCCAAAATGGTCAATTTCAAGCAATGAAATATATGGGAG 
TCTCTGTTGCTAATCCACTGTCAAGTGGTGCACAATTAGTAGGTGGAAGCC 
TAGTTGGTGCTTTAGTCTTTCATGAATGGACTAAGCCAATCCAATTTATTTT 
AGGATTGACAGCGTTGACATTATTAGTTATCGGCTTCTATTTCTCAAGTAA 

1 5 ACGTGATGTTTCAGAACAAGCTTTGGCAACACATCAAGAGTTTTCAAAAG 
GATTTGCTACAATTGCTTATTCAACTGTAGGTTACATCTCGTACGCAGTTTT 
ATTTAACAACATTATGAAGTTCGACGCTATGGCCGTCATTTTACCCATGGC 
TGTTGGAATGTGTCTAGGTGCAATTTGTTTCATGAAGTTTCGTGTTAACTTT 
GAGGCTGTTGTTGTTAAAAATATGATTACAGGTCTCATGTGGGGCGTTGGT 

20 AATGTCTTCATGTTATTGGCAGCAGCTAAAGCAGGGCTAGCAATTGCTTTT 
AGTTTTTCTCAACTTGGAGTAATTATCTCTATTATTGGTGGTATTTTATTTTT 
AGGTGAGACAAAAACGAAGAAAGAGCAGAAATGGGTTGTCATGGGTATC 
CTTTGTTTTGTTATGGGTGCTATATTACTTGGTATTGTTAAATCTTATTAA 

2 5 MEGLLI ALIPMF A WES IGF VSNKIGGRPNQQTFGMTLG ALLF AIIV WLFKQPEM 
TASLWIFGILGGILWSVGQNGQFQAMKYMGVSVANPLSSGAQLVGGSLVGAL 
VFHEWTKPIQFILGLTALTLLVIGFYFSSKRDVSEQALATHQEFSKGFATIAYST 
VGYISYAVLFNNIMKFDAMAVILPMAVGMCLGAICFMKFRVNFEAVVVKNMI 
TGLMWGVGNVFMLLAAAKAGLAIAFSFSQLGVIISIIGGILFLGETKTKKEQK 

30 WVVMGILCFVMGAILLGIVKSY* 



Sequence description: 

35 A] Length: 867 bp - 289 aa (full-length gene) 

B] Posible Shine Dalgarno sequence upstream of 
GTG start codon, no obvious signal peptide 
present. 

40 



TD-97 
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Clone RS-75 



ATGACAACTTACTACGAAGCTATAAACTGGAACGAAATTGAAGATGTTAT 
5 TGATAAATCAACTTGGGAAAAACTAACCGAACAATTTTGGCTCGATACAC 
GTATCCCTTTATCAAATGACTTAGACGATTGGCGCAAACTTTCCGCTCAAG 
AAAAAGATCTTGTTGGCAAGGTTTTTGGAGGCTTAACCCTACTTGATACCA 
TGCAATCAGAAACTGGTGTTGAAGCTATTCGTGCCGATGTTCGCACGCCTC 
ACGAAGAAGCTGTCTTAAACAATATTCAATTCATGGAATCTGTTCACGCTA 

1 0 AATCTTATTCTTCAATTTTCTCAACTTTAAATACTAAATC AGAAATTGAAG 
AAATTTTCGAGTGGACTAATAATAATGAGTTCCTTCAAGAAAAAGCACGT 
ATTATCAATGACATTTATGCTAATGGAAATGCCCTTCAAAAAAAGGTGGCT 
TCCACCTACCTCGAAACTTTCCTTTTTTATTCTGGCTTTTTCACACCTCTTTA 
CTATTTGGGAAATAATAAGTTAGCAAATGTTGCTGAAATCATTAAATTAAT 

1 5 TATTCGTGATG AATCTGTACATGGTACTTATATCGGTTAC AAATTCC AGCTT 
GGTTTTAACGAATTACCAGAAGATGAGCAAGAGAATTTTCGTGATTGGAT 
GTATGACCTCCTTTATCAGCTGTATGAAAACGAAGAAAAATACACCAAGA 
CACTTTATGATGGCGTAGGATGGACTGAAGAAGTTATGACCTTTTTACGCT 
ACAATGCTAATAAAGCTCTTATGAATTTAGGACAAGATCCTTTATTCCCAG 

20 ATACAGCAAATGATGTCAACCCAATTGTTATGAATGGTATTTCAACAGGAA 
CATCAAACCATGACTTCTTCTCTCAAGTAGGTAATGGTTACCTACTTGGTA 
GCGTTGAAGCTATGCATGATGATGACTATAACTATGGATTATAA 

MTTYYEA1NWNEIEDVIDKSTWEKLTEQFWLDTRIPLSNDLDDWRKLSAQEK 
25 DLVGKVFGGLTLLDTMQSETGVEAIRADVRTPHEEAVLNNIQFMESVHAKSY 
SSIFSTLNTKSEIEETFEWTNNNEFLQEKARI1NDIYANGNALQKKVASTYLETF 
LFYSGFFTPLYYLGNNKLANVAEIIKLIIRDESVHGTYIGYKFQLGFNELPEDEQ 
ENFRDWMYDLLYQLYENEEKYTKTLYDGVGWTEEVMTFLRYNANKALMNL 
GQDPLFPDTANDVNPIVMNGISTGTSNHDFFSQVGNGYLLGSVEAMHDDDYN 
30 YGL* 



Sequence description: 

35 A] Length: 960 bp - 320 aa (full length gene) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, but no signal peptide 
present. 

40 



ID-98 
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Clone RS-77 (partial sequence) 



ATGAATTGGTCACGTATCTGGGAACTCGTAAAAATTAATATCCTTTATTCA 
5 AACCCTCAGACTCTATCGGCACTAAGAAAAAAGCAAGAAAAGCATCCTAA 
AAAAGAATTTTCAGCTTATAAATCCATGTTTAGAAATCAGTTATTTCAGAT 
TTTGCTCTTTTCAATAATTTATGTATTTCTCTTTGTATCACTTGATTTTAAAG 
AATATCCGGGCTATTTCACGTTCTACATTGGTATCTTTACACTAGTATCCAT 
TATCTACTCTTTTATTGCGATGTACAGTGTTTTCTATGAGAGTGACGATGTT 
10 AA 

MNWSRIWELVKINILYSNPQTLSALRKKQEKHPKKEFSAYKSMFRNQLFQILL 
FSIIYVFLFVSLDFKEYPGYFTFYIGIFTLVSIIYSFIAMYSVFYESDDV 



15 



25 



40 



Sequence description: 



A] Length: 311 bp - 103 aa (Partial sequence) 

B] Shine Dalgarno sequence present upstream of 
20 ATG start codon, no obvious signal peptide at 

N-terminus. 



1D-99 



Clone RS-78 (partial sequence) 



TAATCTTTTAGTCAACGGAGCAACAGGAAAATTGCAGGCTATGCGACAGA 
30 TATTCCACCACATAATTTAGCAGAAGTCATTGATGCTGTCGTGTACATGAT 
TGATCACCCTAAAGCTAAATTAGATAAATTAATGGAATTTCTACCTGGTCC 
AGATTTTCCAACTGGCGCTATCATTCAAGGAAAAGATGAAATTCGTAAGG 
CATATGAGACTGGTAAGGGGAGAGTAGCGGTTCGCTCGCGAACTGCTATT 
GAAACCTTAAAAGGTGGTAAGAAACAAATTATTGTTACTGAAATTCCTTAT 
35 GAAGTTAAT 

SFSQRSNRKIAGYATDrPPHNLAEVIDAVVYMIDHPKAKLDKLMEFLPGPDFPT 
GAIIQGKLDEIRKAYETGKGRVAVRSRTAIETLKGGKKQIIVTEIPYEVN 



Sequence description: 



A] Length: 312 bp - 104 aa (Partial sequence) 

B] No obvious Shine Dalgarno sequence or a 
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signal peptide. Both N- and C- termini of ORF 
yet to be elucidated. 

5 ID-100 



Clone RS-79 

10 

ATGGGACGTAAGTGGGCCAATATTGTTGCCAAAAAGACTGCTAAAGATGG 
TGCTAACTCAAAAGTATACGCTAAATTCGGTGTTGAAATATATGTTGCTGC 
AAAGCAAGGTGAACCAGACCCCGAGTCAAACTCAGCTCTAAAATTCGTTT 
TGGACCGTGCTAAGCAAGCACAAGTTCCAAAGCATGTTATTGATAAAGCG 

1 5 ATTGATAAAGCC AAAGGAAAC AC AGATGAAACTTTCGTAGAGGGACGCTA 
TGAAGGTTTTGGTCCAAATGGTTCAATGATTATTGTGGATACTTTGACATC 
AAATGTTAACCGTACGGCAGCAAATGTACGTACTGCTTACGGTAAGAACG 
GTGGCAATATGGGAGCTTCAGGATCGGTATCCTACTTATTTGATAAAAAAG 
GTGTCATCGTTTTTGCTGGTGATGATGCTGACACTGTCTTCGAACAATTACT 

20 TGAAGCGGATGTAGACGTAGATGATGTTGAAGCAGAAGAGGGAACAATA 
ACAGTTTATACCGCCCCAACAGATCTTCATAAAGGTATCCAAGCACTTCGC 
GATAATGGTGTAGAAGAATTCCAAGTTACTGAACTTGAAATGATTCCTCAA 
TCAGAAGTAGTATTGGAAGGTGATGACCTTGAAACTTTTGAAAAGCTT 

25 MGRKWANIVAKKTAKDGANSKVYAKFGVEIYVAAKQGEPDPESNSALKFVL 
DRAKQAQVPK.HVIDKAIDKAKGNTDETFVEGRYEGFGPNGSM11VDTLTSNV 
NRTAANVRTAYGKNGGNMGASGSVSYLFDKKGVIVFAGDDADTVFEQLLEA 
DVDVDDVEAEEGTITVYTAPTDLHKGIQALRDNGVEEFQVTELEMIPQSEVVL 
EGDDLETFEKL 

30 

Sequence description: 

A] Length: 654 bp - 21 8 aa (Partial sequence) 
35 B] Possible Shine Dalgarno sequence upstream 

of ATG start, no obvious signal peptide 



40 ID-101 



Clone RS-80 



23/98 
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TTGGAGAAATATTTGAAGAACCCGATTACATGGATTGGATTAGTTCTTGTG 
GTTACGTGGTTTTTAACTAAAAGTAGTGAATTTTTGATTTTTGGTGTGTGTG 
TCTTGTTGTTAGTATTTGCTAGTCAAAGTGAT 

5 

MEKYLKNPITWIGLVLVVTWFLTICSSEFLIFGVCVLLLVFASQSD 



10 



15 



20 



Sequence description: 



A] Length: 135 bp - 45 aa (partial sequence) 

B] Shine Dalgarno sequence upstream of TTG 
start codon with possible signal peptide 
evident at N-terminus. 



ID- 102 



Clone RS-81 



ATGACACAATCAGATGCATATCTCTCGTTGAACGCGAAGACACGCTTTAGA 
25 GATCGCACAGGTAATTATCATTTTACTTCGGATAAAGAGGCTGTTGAACAA 
TATATGATAGAACATGTTGAACCTAATACGATGGTGTTCACATCACTAATT 
GAAAAGCTAGATTATTTGGTTTCTAATAACTACTATGAATCGGACCTTCTA 
AAACAATATAACCTTGAGTTTATTTGCCAAATTTTTGAGCATGCATACGCT 
AAGAAATTTGCTTTTCTAAATTTTATGGGGGCTTTAAAATTTTATAATGCTT 
30 ATGCTCTTAAT 



MTQSDAYLSLNAKTRFRDRTGNYHFTSDKEAVEQYMIEHVEPNTMVFTSLIE 
KLDYLVSNNYYESDLLKQYNLEFICQIFEHAYAKKFAFLNFMGALKFYNAYA 
35 LN 



Sequence description: 



40 



A] Length: 318 bp - 106 aa (Partial sequence) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, no obvious signal peptide 
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ID- 103 



Clone 2-11A 

5 

ATGGTATTTATGGCAAATAAGAAAAAAACAAAAGGAAAGAAAACCAGAA 
GACCTACTAAGGCAGAAATAGAGCGTCAAAGAGCTATTCAAAGGATGATT 
ACTGCTCTTGTTTTAACAATTATTCTCTTCTTTGGTATTATCAGATTAGGTA 

1 0 TTTTTGGTATT AC AGTCTAT AACGTC ATCCGTTTTATGGTAGGTAGCTTGGC 
TTACTTATTTATTGCGGCAACTTTAATCTACCTTTATTTCTTTAAATGGTTG 
CGAAAGAAAGATAGCTTAGTAGCAGGTTTTTTGATAGCTTCTTTAGGATTA 
TTGATTGAGTGGCATGCTTACCTTTTCTCAATGCCTATTTTGAAAGATAAA 
GAAATTTTGCGTTCAACTGCTCGATTAATTGTGTCTGATTTAATGCAATTTA 

1 5 AAATCACTGTTTTTGCCGGTGGAGGTATGTTGGGTGCTTTGATTTACAAGC 
CAATTGCTTTTCTCTTTTCTAATATTGGTGCCTATATGATTGGTGTTCTCTTC 
ATCATTTTGGGTCTCTTTTTAATGAGTTCTCTGGAAGTTTATGACATCGTCG 
AATTTATTAGAGCTTTTAAAAATAAAGTGGCAGAGAAGCACGAGCAAAAT 
AAAAAGGAGCGTTTTGCTAAGCGAGAGATGAAAAAAGCAATCGCTGAACA 

20 AGAGCGCATAGAGCGTCAAAAAGCTGAAGAAGAAGCTTATTTAGCTTCGG 
TTAATGTAGACCCTGAAACGGGTGAGATTCTAGAGGATCAAGCTGAGGAC 
AATTTGGATGATGCGCTACCACCTGAGGTAAGTGAAACATCAACTCCGGT 
ATTTGAGCCAGAGATCCTTGCTTATGAGACATCGCCTCAAAATGATCCTTT 
ACCAGTAGAGCCGACAATTTATTTAGAAGACTATGATTCGCCGATTCCTAA 

2 5 T ATG AGAG A A A ATG ATG AGG A A ATGGTTT ATG ATTT AG ATG ATG ATGT AG 

ATGATAGTGATATAGAAAATGTCGACTTTACACCTAAAACGACACTGGTTT 
ATAAATTACCAACGATAGATTTATTTGCACCAGATAAGCCTAAAAATCAAT 
CCAAAGAAAAGGATTTAGTCCGAAAGAATATCAGAGTTTTAGAAGAAACA 
TTTAGAAGTTTTGGTATCGATGTAAAAGTAGAACGTGCTGAAATTGGACCA 
30 TCAGTTACTAAATATGAAATTAAACCAGCAGTTGGAGTTCGTGTGAATCGT 
ATTTCAAATCTATCTGACGACCTAGCTCTTGCTCTTGCAGCAAAAGATGTG 
CGTATAGAAGCACCAATTCCTGGAAAATCATTAATAGGTATTGAAGTTCCT 
AACTCAGAAATTGCAACGGTTTCTTTCCGCGAACTTTGGGAACAATCTGAT 
GCCAATCCTGAAAACCTTTTAGAAGTACCACTAGGAAAAGCTGTTAACGG 

3 5 C AATGCTCGC AGTTTTAACTTAGCTAGAATGCCGCATCTTTTGGTAGCTGG 

TTCAACTGGTTCAGGTAAATCTGTGGCAGTTAATGGAATTATTTCAAGTAT 
TTTGATGAAGGCACGTCCAGATCAAGTTAAGTTTATGATGATTGATCCCAA 
AATGGTTGAATTATCTGTTTATAATGATATTCCACATTTATTAATCCCTGTT 
GTAACCAATCCGCGTAAAGCAAGTAAGGCACTCCAAAAAGTTGTTGATGA 
40 AATGGAAAATCGATACGAGTTATTTAGCAAAATTGGTGTGCGTAATATAG 
CAGGTTATAATACAAAGGTTGAAGAGTTTAATGCTTCCTCTGAGCAAAAAC 
AAATGCCTTTGCCTTTAATCGTTGTCATTGTAGATGAATTGGCTGACTTGAT 
GATGGTTGCTAGTAAAGAAGTTGAAGATGCTATTATTCGTTTGGGGCAAAA 
AGCACGTGCTGCAGGTATCCATATGATTCTTGCAACTCAACGTCCATCCGT 
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AGATGTTATTTCTGGTTTGATTAAAGCAAATGTTCCGTCGCGTATTGCATTT 
GCTGTTTCAAGTGGTACTGATAGCCGTACGATCCTTGATGAAAATGGTGCT 
GAAAAGCTCTTGGGACGGGGTGACATGCTCTTTAAGCCTATTGATGAGAAT 
CATCCAGTACGACTACAAGGTTCCTTTATTTCAGATGATGATGTTGAAAGG 
5 ATCGTTGGTTTTATCAAAGACCAAGCCGAGGCTGACTATGATGATGCCTTT 
GATCCTGGAGAAGTATCTGAAACAGATAACGGCTCTGGTGGTGGCGGCGG 
AGTACCTGAAAGTGATCCTCTTTTTGAAGAAGCCAAGGGACTCGTTTTAGA 
GACGCAAAAAGCAAGTGCCTCAATGATTCAACGCCGATTGTCTGTTGGTTT 
CAATAGAGCAACAAGACTAATGGAAGAATTAGAAGCAGCGGGGGTTATTG 
1 0 GTCC AGCAG A AGG AACC A AGCC ACG AAAAGTTTT AATG ACTCC AACTCCG 
AGTGAATAA 

MVFMANKKKTKGKKTRRPTKAEIERQRAIQRMITALVLTIILFFGIIRLGTFGIT 
VYNVIRFMVGSLAYLFIAATLIYLYFFKWLRKKDSLVAGFLIASLGLLIEWHA 

1 5 YLFSMPILKDKEILRSTARLIVSDLMQFKITVFAGGGMLGALIYKPIAFLFSNIG 
AYMIGVLFIILGLFLMSSLEVYDIVEFIRAFKNKVAEKHEQNKFCERFAKREMK 
KAIAEQERIERQKAEEEAYLASVNVDPETGEILEDQAEDNLDDALPPEVSETST 
PVFEPEILAYETSPQNDPLPVEPTIYLEDYDSPIPNMRENDEEMVYDLDDDVDD 
SDIENVDFTPKTTLVYKLPTIDLFAPDKPKNQSKEKDLVRKNIRVLEETFRSFGI 

20 DVKVERAEIGPSVTKYEIK.PAVGVRVNRISNLSDDLALALAAKDVRIEAPIPGK 
SLIGIEVPNSEIATVSFRELWEQSDANPENLLEVPLGKAVNGNARSFNLARMPH 
LLVAGSTGSGKSVAVNGIISSILMKjVRPDQVKFMMrDPKMVELSVYNDIPHLLl 
PVVTNPRKASKALQKVVDEMENRYELFSKIGVRNIAGYNTKVEEFNASSEQK 
QMPLPLIVVIVDELADLMMVASKEVEDAIIRLGQKARAAGIHMILATQRPSVD 

25 VISGLIKANVPSR1AFAVSSGTDSRTILDENGAEKLLGRGDMLFKPIDENHPVRL 
QGSFISDDDVERIVGFIKDQAEADYDDAFDPGEVSETDNGSGGGGGVPESDPL 
FEEAKGLVLETQKASASMIQRRLSVGFNRATRLMEELEAAGV1GPAEGTKPRK 
VLMTPTPSE* 

30 

Sequence description: 

A] Length: 2451 bp - 817 aa (Full-length gene) 

B] Shine Dalgarno sequence present upstream of 
35 ATG start codon, possesses a potential signal 

peptide 



ID- 104 

40 



Clone 2-18/22b 



ATGTCACAAGAGCAAGGAAAAATTTATATTGTAGAAGATGATATGACGAT 
TGTGTCACTTTTAAAAGATCATTTATCAGCTAGCTATCATGTCTCTAGTGTC 
AGCAATTTTCGTGATGTGAAACAAGAAATTATCGCATTTCAACCCGATTTG 
ATACTAATGGATATTACGTTACCCTATTTTAATGGTTTTTACTGGACTGCAG 
5 AATTGCGTAAGTTTTTAACAATTCCTATTATTTTCATTTCATCTAGTAATGA 
TGAAATGGATA TGGTTATGGCATTAAATATGGGGGGTGATGACTTTATTTC 
AAAACCATTCTCTCTAGCTGTATTAGATGCTAAGCTAACTGCTATTTTAAG 
GAGAAGTCAACAATTTATCCAACAGGAATTAACTTTTGGGGGATTTACGTT 
GACAAGAGAAGGGTTATTGTCTAGCCAAGATAAAGAGGTTATTTTATCGC 
1 0 CAAC AGAAAATA AAATCCTATCTATCTTGCTC ATGC ATCCTAAAC AAGTAG 
TCTCAAAAGAGTCTCTATTAGAGAAACTTTGGGAAAATGATAGTTTTATTG 
ATCAAAATACACTTAATGTTAATATGACACGCTTACGTAAAAAAATTGTCC 
CAATAGGTTTTGATTACATTCATACAGTGAGAGGAGTTGGGTATTTACTAC 
AATGA 

15 

MSQEQGKIYIVEDDMTIVSLLKDHLSASYHVSSVSNFRDVKQE1IAFQPDLILM 
DITLPYFNGFYWTAELRKFLT1P1IF1SSSNDEMDMVMALNMGGDDFISKPFSLA 
VLDAKLTAILRRSQQFIQQELTFGGFTLTREGLLSSQDKEVILSPTENKILSILLM 
20 HPKQVVSKESLLEKLWENDSFIDQNTLNVNMTRLRKKIVPIGFDY1HTVRGVG 
YLLQ* 



Sequence description: 

25 

A] Length: 669 bp - 223 aa (full-length gene 
sequence) 

B] Shine Dalgarno sequence present upstream of a GTG start codon. 
Was not identified directly by LEEP. This gene was found upstream of 

30 gene ID- 10 described in WO 00/06736. 



ID- 105 
35 Clone 2-20 



ATGTATCAAACTCAGACAAATAAGGAAAAATTTGTTTTATTTTTGAAATTA 
TTTATCCCAGTATTGATTTATCAATTTGCTAATTTTTCAGCTACTTTTATTGA 
40 TTCGGTTATGACTGGACAGTATAGTCAGCTACATTTGGCAGGTGTGTCAAC 
TGCTAGTAATTTATGGACTCCGTTTTTCGCTTTATTAGTAGGTATGATTTCA 
GCATTAGTACCAGTAGTTGGTCAACATTTGGGTAGAGGAAATAAAGAACA 
AATTCGCACAGAATTTCATCAATTTCTATATTTAGGTTTGATACTGTCCTTA 
ATATTATTTTTAATCATGCAATTTATTGCTCAACCTGTCTTGGGGAGTTTGG 
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GTTTAGAAGATGAAGTTCTAGCAGTTGGTCGTGGTTATTTAAATTATATGT 
TGATTGGAATCATGCCGCTGGTGTTGTTTAGCATTTGCCGTTCATTCTTTGA 
TGCATTGGGGTTAACAAGGTTATCTATGTATCTGATGCTTTTAATTCTACCC 
TTTAATTCATTTTTTAATTATATGCTTATCTACGGTAAATTTGGTATGCCTA 
5 GACTAGGAGGTGCGGGGGCAGGTCTTGGAACTTCTTTAACTTATTGGGCTA 
TTTTTATTGGTATTATTATTGTGATGTCACTTCATCCTCAAATTAAAACATA 
TCATATATGGACTCTGGAAAGAATAAAAGCTCCTTTGATTATTGAAGATAT 
TCGATTGGGATTACCGATTGGTTTACAAATTTTTGCAGAAGTTGCAATTTTT 
GCAGTAGTAGGCTTATTCATGGCAAAATTTTCTTCAATCATTATTGCAGCA 

1 0 C ATC AGGCTGCTATGAATTTTTCATCATTAATGTATGCATTTCCTTTAAGTA 
TTTCCACTGCTCTAGCTATTACAATATCGTTTGAAGTAGGGGCAGAGCGCT 
TTCAGGACGCAACCACTTATAGTAGGATAGGACGCTTAACAGCGGTAGGG 
ATTACATCAGGAACCTTACTATTTTTATTTCTATTTCGTGAGAATGTAGCAG 
CAATGTATAATAGTGCCCCTCACTTTGTCGCTATTACAGCTCAATTCCTAAC 

1 5 TTATAGTCTCTTTTTCCAGTTTGCAG ATGCTTATGCAGCTCCTGTACAGGGG 
ATTTTACGAGGCTATAAGGATACAACAAAACCATTTATGATCGGTGCGGG 
CTCTTATTGGTTATGTGCTTTGCCATTAGCGGTTATCTTAGAAAAAAATAG 
CCAGTTAGGTCCGTTTGCCTATTGGATTGGTTTAATCACAGGTATTTTTGTT 
TGTGGTCTATTTCTAAACCAACGTCTGCAAAAGATTAAGAAGTTGTATTAT 

20 TAA 

MYQTQTNKEKFVLFLKLFIPVLIYQFANFSATF1DSVMTGQYSQLHLAGVSTAS 

NLWTPFFALLVGMISALVPVVGQHLGRGNKEQIRTEFHQFLYLGLILSLTLFLI 

MQFIAQPVLGSLGLEDEVLAVGRGYLNYMLIGIMPLVLFSICRSFFDALGLTRL 

2 5 SM YLMLLILPFNSFFN YMLIYGKFGMPRLGG AG AGLGTSLT Y W AIFIGIIIVMS 
LHPQIKTYHIWTLERIKAPLIIEDIRLGLPIGLQIFAEVAIFAVVGLFMAKFSSIIIA 
AHQAAMNFSSLMYAFPLSISTALAITISFEVGAERFQDATTYSRIGRLTAVGITS 
GTLLFLFLFRENVAAMYNSAPHFVAITAQFLTYSLFFQFADAYAAPVQGILRG 
YKDTTKPFMIGAGSYWLCALPLAVILEKNSQLGPFAYWIGLITGIFVCGLFLNQ 

30 RLQKIKKLYY* 



Sequence description: 

35 

A] Length: 1341 bp - 447 aa (full length gene) 

B] Shine-Dalgarno sequence present upstream of 
ATG start codon, There is a potential signal 
peptide sequence 

40 



ID- 106 
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Clone 2-4A 



TTGCTAGTTTCTTCTCTAGTTTCTTGTTCATTTTTTCTTGTCATTTCGTCGTT 

GTCTTCATCAACACGAAATAAGTCTATAAACTTATCAAATAATTTCATAGA 

CTTATTATATCAATTTTCAATAAAATGCTATAATAAAACCATGTCATTTTCA 

TTAAAAATTAGAAATCCATACGGTGAACATACCGTTAAAGAACTCCTTGA 

AGATTATTTTTTGATTCCACGTAAGATTAGACATTTTTTGCGTGTTAAAAAA 

CATGTACTTATAAACAATGAATTCATTAATTGGCAAACTGTCGTCCAAGAA 

AACGATACTATTACCTTAATCTTTGATGATGAGGATTACCCTACTAAAAAA 

ATTCCTCTGGGCAGAGCAGAGCTTATTGATTGTCTTTATGAGGATGAACAT 

CTTATTATCGTTAATAAACCTGAAGGTATGAAAACTCACGGTAACCAACCA 

AATGAAATAGCACTGTTAAATCATGTATCTGCCTATTCTGGACAAACATGC 

TATGTTGTTCATCGCCTAGATATGGAGACCAGTGGAGCTGTTTTATTTGCT 

AAAAATCCATTTATACTTCCCCTTATCAATCAACGCTTAGAACGAAAAGAA 

ATTTGGCGTGAATATTGGGCTTTAGTTGAAGGAAAATTTTCACCTAAGCAT 

CAAGTTTTGAGAGACAAAATTGGACGGAACCGTCATGACAGACGTAAACG 

AATCATTGATTCTAAAAACGGTCAACATGCTATGACAATCATTGACGTTTT 

GAAGTATATCCAAAATAGTAGTCTCATAAAATGCCGACTGGAAACCGGAA 

GAACCCATCAAATTCGCATTCACTTATCTCATCACGGACATCCTTTAATAG 

GAGATCCCCTCTACAACCCTTCTTCTAATAATGAAAGGTTAATGCTACACG 

CTCACCGATTGACTCTATCCCATCCATTAACTTGCGAAACTATTAGCGTAG 

AGGCCCCTTCATCTACTTTCGAGAAGGTTTTAAACAATTATAAAAAAGGAG 

TTGGATAA 

MLVSSLVSCSFFLVISSLSSSTRNKS1NLSNNF1DLLYQFSIKCYNKTMSFSLK1R 

NPYGEHTVKELLEDYFLIPRKIRHFLRVKKHVLINNEFINWQTVVQENDTITLIF 

DDEDYPTKKIPLGPvAELIDCLYEDEHLriVNKPEGMKTHGNQPNEIALLNHVSA 

YSGQTCYVVHRLDMETSGAVLFAKJSfPFILPLINQREEPvKEIWPvEYWALVEGKF 

SPKHQVLRDKIGRNRHDRRKRIIDSKNGQHAMTIIDVLKYIQNSSLIKCRLETG 

RTHQIRIHLSHHGHPLIGDPLYNPSSNNERLMLHAHRLTLSHPLTCETISVEAPS 

STFEKVLNNYKKGVG* 



Sequence description: 

A] Length: 1029 bp - 343 aa (Full length gene sequence) 

B] No obvious Shine-Dalgarno sequence upstream 
of the putative TTG start codon. Possesses a 
potential leader peptide sequence. 
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ID- 107 



Clone 2-54 

5 

GAACTAAATGCAACTCAACCTAATAATAGAACTACCTATATTATACCCGAA 

AGCAGTCATTCCATTGCAGAACAACAGAGATTCCTGATAGAATCAAAGGG 

TTCTTCGGTTGCATTACTTAATAGCGATGAATTTAGAAAGACAGCGGGAGA 

1 0 GG ATAG AGGTTTTG A AAGGGATAAGTTGAGGTCTTTGG ATATC ATTCCTAA 
GGGAGATTTATCGACAAGTAATGTCATAGGTAATACGGACATTGCTAGTC 
AGATATCGTTGGGCTTTAAAAAGAATGCGATGCAGGAACACCATCTTACT 
AAAACATTCTCTCAAAAGGATGGAAAGTTATCGTCTGTTATAGAGGGGAT 
GCTTGCTATTGGCAAAGAGAAAGTAGAGAAAGAAATAAAATATAGTGGTA 

1 5 ATTTATGGCAAAAATTAAAAGCTAAGGCACACTGCCTTGTTTGCTGTGTTG 
ATAATTTGAATTTTGAAGATATAAAATCTTATTTTCAATATTATTGTCATCT 
AAACCATCAGCTCAAATTACCTAAAGGTGCTATACTTTCTGCTAAAACAGA 
AGTATATAGGGGAGGAGATTTTGGGAGAAAAAATAAAGATAATGTGTTTG 
GTTACCGTATCCCCTCATTATTGAAAACCCAAAAAGGAACTTTACTTGCGG 

20 GAGCTGATGAAAGAATTGAGCAAGCTTGTGATTGGGGAAACATAGGAATG 
GTTATTCGCCGTAGTGAGGATGATGGTGTCACTTGGGGAAAAAGAGAAAC 
TATTGTCAATCTCCGTAATAACCCTAGAGTTCCGCTAGTTACTAGTGGTGA 
CTATAGTGGCTCACCTATTAATATGGATATGGCATTAGTTCAAGATACTAG 
CTCCAAGACGAAACGTATTTTTTCAATATATGATATGTTTCCAGAAGGAAG 

25 AGGCGTTATTAGTATTGCTAACACACCTGAAAAAGAATATACCCAAATCG 
GAGGACAGTCTTATCTTAATTTATATAATAATGGAAAGAAATCGAAGGTTT 
TTACTATCCGTGACAAAGGTATTGTATATAATTTTAAAGGGAAAAAGACTG 
ATTATCATGTTATAACAGAAACTACTAAAAGTGACCATTCAAATCTAGGGG 
ATATTTATAAGGGAAAACAGCTACTTGGAAATATATATTTTACAAAACATA 

30 AAACGTCACCATTTCGTTTAGCAAAATCAAGCTATGTGTGGATGTCATATA 
GCGATGATGATGGTAGGACATGGTCATCACCTAGAGATATAACAGCAAGT 
CTTCGTCAGAAAGGCATGAAATTTTTGGGAATAGGACCTGGAAAAGGTAT 
AGTTTTAAAATGGGGGCCACACGCTGGTCGTATTATTATTCCTGCCTATTCT 
ACGAATTGGAAATCTCATCTAAGAGGTTCACAATCTTCACGCCTAATTTAT 

3 5 TC AG ACGACC ATGG AAA AACGTGGCATACTGGAAAAGC AGTTAATG ATAA 
CCGTATACTTTCTAATGGTGAAAAAATTCACTCCTTAACAATGGATAATAA 
AAAAGAACAAAATACAGAATCCGTACCCGTTCAATTGAAAAATGGGGACA 
TTAAGTTATTTATGAGGAATCTAACTGGTAACCTAGAAGTAGCCACAAGTA 
AAGACGGCGGGGAGACTTGGCAAAACCATGTTAAACGATATAAGGAAATT 

40 CATGATGCTTACGTCCAACTATCAGCTATTCGCTTTGAGCATGACAAAAAA 
GAGTATATTTTATTAGTGAATGCTAATGGGCCAGGGAAGAAGTGCCAAGA 
TGGATATGCACGTCTAGCGCAAGTTAATCGAAATGGTAGTTTTAAGTGGTT 
ATATCACCATCACATTCAAGATGGTTCGTTTGCTTACAACTCTGTTCAACA 
ACTTAATAATGATCAATTTGGTGTCCTTTATGAACATAGAGAAAAACATCA 
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AAATAGTTTTACTTTAAATTACAAAGTTTTTAATTGGAGTTTTCTTAGTCAA 
AATACAGAGAAGCAAGGCACTTTATGGGAGAAAATGGCAGCAAATTGGCA 
TGTTTTGTTTAAATTTTATTTATGA 

5 

ELNATQPNNRTTYIIPESSHSIAEQQRFLIESKGSSVALLNSDEFRKTAGEDRGF 
ERDKLRSLDIIPKGDLSTSNVIGNTDIASQISLGFKFCNAMQEHHLTKTFSQKDG 
KLSSVTEGMLArGKLEKVEKEIKYSGNLWQKLKAKAHCLVCCVDNLNFEDTKS 
YFQYYCHLNHQLKLPKGAILSAKTEVYRGGDFGRFGMKDNVFGYRIPSLLKTQ 

10 KGTLLAGADERIEQACDWGNIGMV1RRSEDDGVTWGKRETIVNLRNNPRVPL 
VTSGDYSGSPINMDMALVQDTSSKTKRIFSIYDMFPEGRGVISIANTPEKEYTQT 
GGQSYLNLYNNGKKSKVFTIRDKGIVYNFKGKKTDYHVITETTKSDHSNLGDI 
YKGKQLLGNIYFTKHKTSPFRLAKSSYVWMSYSDDDGRTWSSPRDITASLRQ 
KGMKFLGIGPGKGIVLKWGPHAGRIIIPAYSTNWKSHLRGSQSSRLIYSDDHG 

1 5 KTWHTGKAVNDNRILSNGEKIHSLTMDNKKEQNTES VPVQLKNGDIKLFMRN 

LTGNLEVATSKDGGETWQNHVKRYKEIHDAYVQLSAIRFEHDKKEYILLVNA 

NGPGKKCQDGYARLAQVNRNGSFKWLYHHHIQDGSFAYNSVQQLNNDQFG 

VLYEHREKHQNSFTLNYKVFNWSFLSQNTEKQGTLWEKMAANWHVLFKFYL 
* 

20 

Sequence description: 

A] Length: 2052 bp - 684 aa (partial gene sequence) 

B] N-terminus has yet to be determined 

25 

ID- 108 



30 Clone 2-61 



ATGCCTAAATTAATCGTATCTTTCCTCTGCATTTTATTATCCCTGACTTGTG 
TAAACTCTGTGCAAGCTGAAGAACATAAAGATATTATGCAAATTACCCGA 

3 5 G AAGCCGG ATATG ATGTT AA AG AT ATTAAT AA ACCTAA AGCGTCTATCGTT 
ATTGACAATAAAGGTCATATTTTGTGGGAAGATAACGCCGATTTAGAACGT 
GATCCCGCTAGCATGTCTAAAATGTTTACTTTATATTTACTATTTGAAGACT 
TAGCTAAAGGAAAAACAAACCTCAACACCACAGTGACTGCAACAGAAACA 
GACCAAGCCATAAGTAAGATTTATGAAATTAGTAATAACAATATTCATGCT 

40 GGGGTTGCTTATCCTATTCGTGAACTGATTACTATGACGGCTGTCCCGTCA 
TCTAATGTAGCAACTATTATGATTGCTAACCACTTATCACAAAACAATCCT 
GACGCCTTTATTAAACGAATCAATGAAACCGCCAAGAAACTCGGTATGAC 
AAAAACTCACTTTTATAACCCCAGTGGGGCGGTAGCGAGTGCTTTTAATGG 
ACTTTACTCCCCAAAAGAATACGATAACAATGCTACTAACGTTACGACTGC 



ACGTGATCTATCAATTTTAACCTATCATTTCCTTAAAAAATACCCTGATATA 

CTGAACTATACAAAATATCCTGAAGTCAAGGCCATGGTCGGAACTCCTTAT 

GAAGAAACATTTACAACTTATAACTACTCTACCCCCGGCGCTAAATTTGGA 

TTAGAAGGAGTAGATGGCTTAAAAACTGGTTCTAGCCCTAGCGCTGCTTTT 

AATGCCTTAGTTACAGCTAAACGCCAGAATACTCGCTTGATAACTGTGGTT 

TTAGGAGTTGGCGATTGGTCAGACCAAGACGGAGAGTACTATCGTCATCC 

GTTTGTCAACGCTCTTGTAGAAAAAGGTTTTAAAGACGCTAAAAATATTTC 

TTCTAAAACTCCTGTATTAAAAGCCGTTAAACCTAAAAAAGAAGTTACTAA 

AACCAAAACTAAATCTATTCAAGAACAGCCTCAAACAAAAGAACAGTGGT 

GGACAAAAACAGATCAATTTATCCAATCACATTTTGTATCTATTTTAATTG 

TTCTGGGCACCATCGCTAGCCTTTGTCTTTTAGCTGGGATAGTATTACTTAT 

AAAGCGCTCTAGATAA 

MPKLIVSFLCILLSLTCVNSVQAEEHKDIMQITREAGYDVKDINKPKASIVIDN 

KGHILWEDNADLERDPASMSKMFTLYLLFEDLAKGKTNLNTTVTATETDQAI 

SKIYEISNNNIHAGVAYPIRELITMTAVPSSNVATIMIANHLSQNNPDAFIKRINE 

TAKXLGMTKTHFYNPSGAVASAFNGLYSPKEYDNNATNVTTARDLSILTYHF 

LKKYPDILNYTKYPEVKAMVGTPYEETFTTYNYSTPGAKFGLEGVDGLKTGS 

SPSAAFNALVTAKRQNTRLITVVLGVGDWSDQDGEYYRHPFVNALVEKGFK 

DAKNISSKTPVLKAVKPKKEVTKTKTKSIQEQPQTKEQWWTKTDQFIQSHFVS 

ILIVLGTIASLCLLAGIVLLIKRSR* 



Sequence description: 

A] Length: 1 1 88 bp - 396 aa (full length gene) 

B] Shine Dalgamo sequence present upstream of 
ATG start codon, possesses a potential signal 
peptide 



ID- 109 



Clone 45 



ATGACTGAAAAATATTATAATTGGGCAACGCTTGGAACCGGCGTTATTGCC 

AACGAATTAGCCCAAGCACTGGAAGCACGTGGACAAAAATTATATTCTGT 

AGCTAATAGAACTTACGACAAAGGACTTGAATTTGCTAACAAATATGGTA 

TCCAAAAAGTTTATGATCACATAGATCAAGTATTTGAAGACCCTGAAGTGG 

ATATCATTTATATCTCTACTCCCCACAATACTCACATCTCATTTTTACGAAA 
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GGCTTTAGCAAATGGTAAGCACGTTCTTTGCGAAAAATCTATTACTTTAAA 
TAGTACTGAGCTTAAAGAAGCCATAGATTTAGCCGAAACTAACCATGTTGT 
CTTAGCTGAAGCCATGACTATTTTTCATATGCCAATTTACCGCCAATTAAA 
AACATTAGTTGATAGTGGAAAATTAGGACCGTTAAAAATGATTCAAATGA 
5 ATTTCGGAAGTT ATA AAGAATATGATATG ACTA ACCGTTTTTTCAGTCGTG 
ACCTAGCAGGCGGTGCTTTGCTGGACATTGGTGTTTATGCACTTTCTTGTAT 
TCGCTGGTTTATGTCAGAAGCACCTCACAACATTACCTCTCAAGTTACATT 
TGCACCAACAGGGGTTGATGAACAAGTTGGTATCCTACTAACCAACCCAG 
CAAATGAGATGGCGACTGTCAGCCTTAGTTTACATGCAAAACAACCTAAA 

1 0 CGAGC AACTATCGCTTACGATAAAGGCTACATTGA ACTTTTTGAATATCCG 
CGAGGACAAAAGGCAGTTATTACTTATACTGAGGATGGGCATCAAGATAT 
TATCGAAGCTGGCAAAACTGAAAATGCTCTCCAATATGAGGTAGCTGATA 
TGGAAGAAGCCATTTCAGGAAAAACTAACCACATGTACTTAAACTATACC 
AAAGATGTTATGGATATCATGACACAGCTACGTCAAGAATGGGGATTTAC 

1 5 CTACCCAGAAGAAGAAAAATG A 

MTEKYYNWATLGTGVIANELAQALEARGQKLYSVANRTYDKGLEFANKYGI 
QKVYDH1DQVFEDPEVDIIYISTPHNTHISFLRKALANGKHVLCEKSITLNSTEL 
KEAIDLAETNHVVLAEAMTIFHMPIYRQLKTLVDSGKLGPLKMIQMNFGSYK 
20 EYDMTNRFFSRDLAGGALLDIGVYALSCIRWFMSEAPHNITSQVTFAPTGVDE 
QVGILLTNPANEMATVSLSLHAKQPKPvATIAYDKGYIELFEYPRGQKAVITYT 
EDGHQDIIEAGKTENALQYEVADMEEAISGKTNHMYLNYTKDVMDIMTQLR 
QEWGFTYPEEEK* 

25 

Sequence description: 

A] Length: 984 bp - 328 aa (full length gene) 

B] Shine Dalgarno sequence present upstream of 
30 ATG start codon, possesses a potential signal 

peptide 



35 ID-110 



Clone 2-2 

40 

GTGTATTCTCCTGTTAAATCTTCTAAAGGAAAAGTGATATTGTTAAAAAGT 
GATTTTCTAAAGAGCTTCATAGAAAGGAGAGGAAATATTTGTTTT 

MYSPVKSSKGKVILLKSDFLKSFIERRGNICF 
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Sequence description: 

5 A] Length: 96 bp - 32 aa (partial sequence) 

B] GTG start codon - no obvious Shine-Dalgarno 
sequence 

Possesses a potential signal peptide 

10 

ID-1 1 1 



15 Clone 2-3 



AAATACTGTATCATTGCAACCTCAAATGCAGGTTTTGGAAACGAAGCATTT 
ACAGGTGACAGCGATAAAGACTTGAAAATTATGGAACGAATTTCTCCATA 

20 TTTCCGTCCAGAATTTCTAAATCGTTTCAATGGTGTTATTGAATTCTCTCAC 
CTAAGCAAAGATGACTTAAGCGAAATTGTAGATTTGATGCTTGATGAAGTT 
AACCAAACAATTGGCAAAAAAGGAATTGACCTTGTGGTAGATGAAAATGT 
TAAATCACACTTAATTGAACTGGGTTATGACGAAGCAATGGGAGTACGTC 
CATTGCGCCGTGTCATCGAGCAAGAAATTCGAGATCGCATCACAGACTACT 

25 ATCTCGATCATACAGACGTTAAACACCTAAAAGCTAATTTGCAAGATGGCC 
AAATCGTCATTTCTGAAAGATAA 

KYCIIATSNAGFGNEAFTGDSDKDLKTMER1SPYFRPEFLNRFNGVIEFSHLSKD 
DLSEIVDLMLDEVNQTIGKKGIDLVVDENVKSHLIELGYDEAMGVPvPLRRVIE 
30 QEIRDRITDYYLDHTDVKLHLKANLQDGQ1V1SER* 



Sequence description: 

35 A] Length: 429 bp - 143 aa (partial sequence) 

B] N-terminus yet to be elucidated. This gene 
was not in frame with nuc 



40 

1D-112 



Clone 2-5 
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ATGTCAATGAATTTTTCATTTTTACCACAATATTGGTCCTATTTTAATTATG 
GTGTGATGGTAACCATTATGATTTCAACATGTGTTGTTTTTTTTGGAACTAT 
TATAGGCGTGTTAATTGCTTTAGTAAAGCGTACTAATTTACATTTTCTCACA 
ATATTAGCTAATTTCTATGTATGGGTATTTCGTGGGACACCGATGGTAGTT 
5 CAAATTATGATTGCTTTCGCATGGATGCATTTTAACAATTTACCAACAATT 
AGCTTTGGTGTTTTAGATTTAGATTTTACACGACTTTTACCTGGTATCATTA 
TCATTTCCTTAAATAGTGGTGCCTATATTTCGGAAATTGTACGTGCAGGGA 
TTGAGGCTGTACCATCTGGACAAATAGAAGCAGCTTACTCGTTGGGGATTC 
GACCTAAAAATACACTTCGCTATGTTATCTTACCCCAAGCTTTTAAAAATA 
1 0 TTTTACCTGCTCTAGGG A ATG A ATTTATTACAATTATTAAAGATAGTGCTCT 
CCTTCAAACTATTGGTGTCATGGAATTATGGAACGGAGCACAATCAGTTGT 
AACGGCTACTTACTCACCAGTTGCACCGTTATTATTTGCAGCATTTTACTAT 
TTAATGTTGACAACGATTCTCTCAGCTTTGTTAAAACAAATGGAGAAATAT 
CTTGGGAAAGGGGTAAAAATAGATGGTTGA 

15 

MSMNFSFLPQYWSYFNYGVMVTIMISTCVVFFGT1IGVLIALVKRTNLHFLTIL 
ANFYVWVFRGTPMVVQIMIAFAWMHFNNLPTISFGVLDLDFTRLLPGIIIISLNS 
GAYISEIVRAG1EAVPSGQIEAAYSLGIRPKNTLRYVILPQAFKNILPALGNEFITI 
IKDSALLQT1GVMELWNGAQSVVTATYSPVAPLLFAAFYYLMLTTILSALLKQ 
20 MEKYLGKGVK1DG* 



Sequence description: 

25 

A] Length: 699 bp - 233 aa (full length gene) 

B] Shine-Dalgarno sequence preceded the 'ATG' 
start codon. Possesses a potential leader peptide 
sequence. 

30 



ID-113 
35 Clone 2-7 



ATGAAAGACCTATTACGAAATAGTCTAGAGCAAAGTGGAAATTTAAGTTT 
TCAAGATATGATTTTACATATTCTTGTAGCAGCTTTATTGAGTGTAGTTATT 
40 TATGTTTCCTATGCTTATACGCATAGTGGAACTGCCTATAGTAAAAAGTTT 
AATGTTTCATTAATGACATTGACGGTCTTGACTGCAACAGTAATGACCGTT 
ATTGGTAATAATGTAGCCTTGTCATTGGGTATGGTCGGTGCCTTGTCAGTT 
GTTCGTTTTAGGACAGCCATAAAAGATTCAAGAGATACAGTTTATATTTTT 
TGGACCATAGTTGTTGGTATCTGTTGTGGTGTCGGTGACTATGTGGTAGCT 
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GCATTAGGAAGTAGCGTTATCTTTATCTTATTATGGGTTATGGGACGTGTT 

AAAAACGAGAATCGTATGTTATTGATTGTGAAGTGCGATAGAACACTAGA 

AGTTGATTTAGAAGGAATTTTCTTCCAATATTTTGACGGAAAAGCTGTTCA 

GCGTGTTAAAAATTCAACAACTAATACTATTGAAATGATTTTCGAAATCTC 

TAGAAAAGATTACGATAAGCAACTCCATGTAGATAATCAGTTAACTGAAA 

AAGTGTACCAATTGGGAAATATTGATTATTTCAACATTGTTAGCCAAAGCG 

ACGAAATCAATGGGTAG 

MKLDLLRNSLEQSGNLSFQDMILHTLVAALLSVVrYVSYAYTHSGTAYSKKFNV 
SLMTLTVLTATVMTV1GNNVALSLGMVGALSVVRFRTALKDSRDTVYIFWTLV 
VGICCGVGDYVVAALGSSVrFILLWVMGRVKNENRMLLIVKCDRTLEVDLEGI 
FFQYFDGKAVQRVKNSTTNTTEMTFEISRKDYDKQLHVDNQLTEKVYQLGNID 
YFNIVSQSDEING* 



Sequence description: 



ID-114 
Clone 2-8 



A] Length: 678 bp - 226 aa (full-length gene) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence-Possesses a potential leader 
peptide sequence 



AAAAATTCATTTTAGATTCATTTTACGACTATATACTCAGAAGTACCAAAC 

CTAATCCAAGGTTTGAAAAAAGAAAGAAGGAAGTCAGTATGACAAACTAT 

AAAAACAACTTTAAAGATGAGGCTATACGTGTTGAAGAGACAACAAAAGA 

ATCATTTTACGATGTTGATATTGCCTTGTTTTCAGCTGGTGGATCTATTTCA 

GCAAAGTTCGCTCCTTATGCAGTAAAGTCTGGAGCAGTTGTAGTAGATAAC 

ACGTCATATTTTCGTCAGAATCCTGATGTTCCACTAGTTGTTCCTGAAGTAA 

ATGCTCATGCCATGATTGGTCATAATGGTATCATAGCTTGTCCCAATTGTTC 

TACTATTCAAATGATGATTGCTTTAGAGCCCATTCGTCAAAAATGGGGGAT 

AGAGCGTGTTATAGTTTCCACCTATCAAGCTGTTTCGGGTTCAGGTGCACG 

TGCTGTTGAAGAAACTAAGGAACAGTTGAGACAAGTTTT 

KFILDSFYDYILRSTKPNPRFEKJIKKEVSMTNYKNNFKDEAIRVEETTKESFYD 
VDIALFSAGGSISAKFAPYAVKSGAVVVDNTSYFRQNPDVPLVVPEVNAHAMI 
GHNGIIACPNCSTIQMMIALEPIRQKWGIERVIVSTYQAVSGSGARAVEETKEQ 
LRQV 
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10 



Sequence description: 



ID-115 



Clone 2-9 



A] Length: 499 bp - 165 aa (partial sequence) 

B] N-terminus has yet to be determined 



1 5 ATGAC AAATG AATTGATAATGCAAGCTTTTGAGTGGT ATTTACCTAGTGAT 
GGGAATCACTGGAAGAAATTAGAGGAGTCTATATCAGACCTTAAAAAACT 
TGGAATTAGTAAAATCTGGTTACCACCAGCATTTAAGGGAACTAGCAGTG 
ATGATGTAGGATATGGTGTTTATGATCTCTTTGATTTAGGAGAATTTGACC 
AGAATGGAACAATTAGAACAAAATATGGTAGGAAAGAAGAGTATCTAAA 

20 GCTTATTAAGTCGTTAAAGGCAAATGGCATTAAACCGTTTGCAGATATCGT 
TCTTAACCATAAAGCCAATGGTGATCATAAAGAAAAATTTCAAGTCATCA 
AAGTCAATCCTGAAAATCGTCAAGAAGCATTAAGTGAACCCTATGAGATT 
GAAGGATGGACGGGATTTGATTTCCCAGGTAGACAGGGTGAGTACAATGA 
TTTT 

25 

MTNELIMQAFEWYLPSDGNHWKKLEESISDLKKLGISK1WLPPAFKGTSSDDV 
GYGVYDLFDLGEFDQNGTIRTKYGRKEEYLKLIKSLKANGIKPFADIVLNHKA 
NGDHKEKFQVIKVNPENRQEALSEPYEIEGWTGFDFPGRQGEYNDF 

30 

Sequence description: 

A] Length: 456 bp - 152 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 

35 Dalgarno sequence, no leader peptide sequence. 



ID-116 

40 

Clone 2-10 
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ATGGAGGTTCTTATGAAGAAAGTGTTAGTAAGTAGTCTTTTGGTTTTAGGG 
ATTACGATAACGTTACAACCAGTAGTTGAGGCTAAGGGGCCAAAAGTAGC 
TTATACACAAGAGGGAATGACTGCTCTTTCGGACACAAATAAAGATAAAG 
TCACTACTATTTCTATTGACGAGATTCAAAAAAGCTTAGAAGGTAAGAAGC 
5 CGATTACTGTTAGTTTTGATATTGATGATACACTGCTTTTCAGTAGTCAATA 
TTTTCAATATGGTAAAGAATATGTAACTCCTGGATCGTTTGATTTTCTTCAT 
AAACAAAAATTCTGGGATCTTGTTGCAAAACGAGGAGATCAAGATTCCAT 
TCCCAAAGAATATGCTAAAAAATTAATTGCTATGCATCAAAAACGAGGAG 
ATAAAATTGTTTTTATAACAGGTAGGACAAGAGGGTCAATGTATAAGGAG 
1 0 GGCG AGGTTG AT AAA ACAGCTA AAGCCTT AGCTAAAG ATTTTAAATTTGT A 
CCATCTGAT 

MEVLMKKVLVSSLLVLGITITLQPVVEAKGPKVAYTQEGMTALSDTNKJDKVT 
TISIDEIQKSLEGKKPITVSFDIDDTLLFSSQYFQYGKEYVTPGSFDFLHKQKFW 
1 5 DLVAKRGDQDSIPKEYAKKLIAMHQKRGDKIVFITGRTRGSMYKEGEVDKTA 
KALAKDFKFVPSD 



Sequence description: 

20 

A] Length: 516 bp - 172 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 
Dalgamo sequence, Possesses a leader peptide 
sequence. 

25 

ID-117 



30 Clone 2-17 



ATGCTTAAAAGATTATTTACTGAAGATGGGGAATTGACAAAGATTAGTCGT 
CGTTTCGTTTGGATGTTAGTGGTTATCTATTGTCTTATTATTGTCAGGATGT 

35 GTTTTGGGCCTCAAATTATGATTGAGGGGGTATCAACTCCGAATGTTCAGC 
GCTTCGGAAGAATTGTAGCTCTTTTAGTACCATTTAATTCTTTTCGTAGTTT 
AGATCAGCTAACTAGCTTTAAAGAGATTCTTTGGGTTATTGGTCAAAATGT 
AGTGAATATTTTACTGCTGTTTCCTCTCATTATAGGGTTACTATCCCTAAAG 
CCAAGTTTACGGAAATATAAAAGCGTTATATTACTTGCTTTCTTGATGTCTC 

40 TTTTCATAGAGTGTACTCAAGTTGTTTTAGATATTTTAATAGATGCTAATCG 
GGTTTTTGAAATCGACGATCTATGGACAAATACCTTAGGCGGTCCTTTCGC 
CCTATGGAGTTATCGAAACATAAAAGGTTGGCTTCTAACTATTAGAAAATG 
A 
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MLKRLFTEDGELTKISRRFVWMLVVIYCLI1VRMCFGPQIMIEGVSTPNVQRFG 
R1VALLVPFNSFRSLDQLTSFKEILWVIGQNVVNILLLFPLIIGLLSLKPSLRKYK 
S V ILL AFLMS LF IECTQ V V LD I LID ANR V F E IDDLWTNTLG GPFALWSYRNIKG 
WLLTIRK* 

5 

Sequence description: 

A] Length: 516 bp - 172 aa (full-length gene) 
10 B] ATG start codon is preceded by an Shine- 

Dalgarno sequence. Possesses a potential leader 
peptide sequence. C-terminus need further 
confirmation. 

15 

ID-118 



Clone 3-3 

20 

ATGAAAAAGCTTACTTTTATTTGGGATTTAGATGGGACATTAATAGATTCG 
TATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCATTTTGGCTTA 
ATATTTGATAAAGAATTAATCCATGAATATATTTTACAGGAATCAGTGGGG 

25 CAATTATTGGTAAACCTTTCAGAGGAAGAGCAAATACCTCATGAAAAACT 
GAAAGCATATTTTACAAAAGAACAAGAAAGTCGAGATTCTAAAATACATT 
TAATGCCATATGCAAAAGAGATTTTAGAATGGACCAAAGAACAAGATATT 
CCCAATTTTATGTATACACATAAAGGAGCAAGTACGCATTCAGTGTTGGAA 
ACCTTGCAGATCTCTCATTATTTTGATGAAATTTTAACTGGTGTTTCGGGAT 

3 0 TCG AGCG AA A ACC AC ATCC AC AAGGG ATT AATT ATTT AGTT AA ACG AT ATT 
CTTTAGATAAATCAATGACTTATTACATAGGAGATCGTCCACTAGATTTGG 
AGGTTGCTCAAAATGCTGGTATAAAATCCATAAACTTAAGGTTAGAGAATT 
CCAAAGAAAACTATAATATTTCAAGTCTCAAAGATATAATATCACTTGATT 
TCACTCGTTTGGATTAA 

35 

MKKLTFIWDLDGTLIDSYVP1MEALEETYRHFGLTFDKELIHEYILQESVGQLL 
VNLSEEEQIPHEKLKAYFTKEQESRDSKIHLMPYAECEILEWTKEQDIPNFMYTH 
KGASTHSVLETLQISHYFDEILTGVSGFERKPHPQGINYLVKRYSLDKSMTYYI 
GDRPLDLEVAQNAGIKSINLRLENSKLENYNISSLKDIISLDFTRLD* 

40 

Sequence description: 

A] Length: 627 bp - 209 aa (Possible Full-length gene) 
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B] ATG start codon is preceded by an possible 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence. 

5 

ID-119 



Clone 3-7 

10 

ATGGAAAAAGAAAAAAAATTAGGTCTTTTACCACTAACAATGCTTGTCATT 

GGCTCTCTTATCGGTGGCGGAATCTTTGATTTAATGCAAAATATGAGTTCC 

AGAGCCGGTTTGGTACCAATGCTTATTGCTTGGGTAATTACTGCTATCGGG 

1 5 ATGGGAACTTTCGTTTTAAGTTTTCAAAATTTATCTGAA AAAAGGCCGGAC 
CTAACAGCTGGAATCTTTAGTTACGCTAAAGAGGGGTTTGGAAACTTTATG 
GGATTTAACTCTGCATGGGGTTATTGGTTATCAGCTTGGCTTGGAAATGTT 
GCCTACGCTGCACTCTTATTCAGTTCACTCGGTTATTTCTTTAAATTCTTTG 
GTAATGGAAATAATATCATCTCAATTATTGGAGCAAGTATAGTTATTTGGG 

20 TTGTCCATTTCTTAATTTTAAGAGGTGTTAATACAGCTGCATTTATTAATAC 
CGTAGTTACCTTTGCAAAATTAGTACCTGTTATTATTTTCTTAATTTCAGCG 
TTATTAGCTTTCAAATTTAACATTTTTAGTCTTGATATCTGGGGAAATGGAT 
TACATCAATCAATTTTCAACCAAGTCAATTCAACTATGAAAACCGCTGTTT 
GGGTATTTATTGGTATTGAGGGCGCCGTTGTCTTCTCAGGTCGTGCTAAAA 

25 AACACTCTGATATTGGTAAAGCAAGTATCCTAGCATTATTCACTATGATTT 
CACTTTATGTATTGATTTCTGTTTTATCACTTGGTATCATGTCACGTCCAGA 
ACTTGCAAACTTAAAAACACCAGCTATGGCTTACGTTCTAGAAAAAGCTGT 
TGGTCACTGGGGTGCTATCTTAGTTAACCTTGGTGTTATCATTTCAGTATTT 
GGCGCTATTCTTGCTTGGACTTTATTTGCAGCAGAATTACCATATCAAGCT 

30 GCTAAAGAAGGTGCTTTTCCTAAATTTTTTGCAAAAGAAAATAAAAACAA 
AGCTCCAATCAACTCACTCTTAGTCACTAATCTTTGTGTACAAGCATTCTTA 
ATCACGTTCTTATTCACACAAAGTGCTTATCGTTTTGGTTTCGCATTAGCAT 
CATCTGCTATCTTAATTCCTTATGCTTTTACAGCACTATATCAATTACAATT 
CACACTCCGTGAGGATAAGTCAACTCCAGGACATCAAAAGAATTTAATTA 

3 5 TCGGTATCCTCGCT AC AATCTATGCTGTTT ACCTT ATCTACGCTGGTGGTTT 
TGATTACTTACTTTTGACAATGATTGCTTATACTCTAGGTATGATTCTCTAT 
ATTAAAATGAGAAAAGATGACAAGCTTGGCGTAATCATGGTCATAGCTGT 
TTCCAGTGTGAAATTGTTATCC 

40 

MEKEKKLGLLPLTMLVIGSLIGGGTFDLMQNMSSRAGLVPMLTAWVITATGMG 
TFVLSFQNLSEKRPDLTAGIFSYAKEGFGNFMGFNSAWGYWLSAWLGNVAY 
AALLFSSLGYFFKFFGNGNNIISIIGASIVIWVVHFLTLRGVNTAAFINTVVTFAK 
LVPV1IFLISALLAFKFN1FSLDTWGNGLHQSIFNQVNSTMKTAVWVFIGIEGAV 
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VFSGRAKKHSDIGKASILALFTMISLYVLISVLSLGIMSRPELANLKTPAMAYV 

LEKAVGHWGAILVNLGVIISVFGAILAWTLFAAELPYQAAKEGAFPKFFAKEN 

KNKAPINSLLVTNLCVQAFLTTFLFTQSAYRFGFALASSAILTPYAFTALYQLQF 

TLREDKSTPGHQKNL1IGILATIYAVYLIYAGGFDYLLLTMIAYTLGMILYIKMR 

KLDDKLGVIMVIAVSSVKLLS 



Sequence description: 



A] Length: 1356 bp - 452 aa (partial sequence) 

B] ATG start codon is preceded by an possible 
Shine-Dalgarno sequence. Possesses a potential 
leader peptide sequence. 



ID- 120 



Clone 3-8 



ATGAAATTTGAAAAACGGCAGGTCTATTATGTTGTCATAACATTTGCTATT 

TGCTATGCTATACAGGCTTATTGGGGAGCTGTTTCTAATATTTTAACTACGC 

TTCATAAGGCAATATTTCCTTTTTTGATGGGAGCTGGAATTGCCTATATTAT 

TAATATTGTAATGTCAGTCTATGAGCGATTATATATAAAGCTTTTTAAAGG 

ATCTAGACTATTAATGGCAATCAAGCGTAGTGTTTCTATGATTTTATCCTAT 

GCAACTTTTATTGGTTTAATTGTCTGGCTATTTTCAATTGTCATTCCAGATT 

TGATTTCTAGTTTGAGTTCTTTATTGGTTATTGATACCGGAGCACTTGCTAA 

ATTGGTTAATAATCTCAATGAAAATAAACAAATTTCTGAGGCTTTAAATTA 

TATGGGAACAGATAAAGACTTAGTTTCTACTTTAAGTGGTTATAGCCAGCA 

GATTTTGAAGCAAGTTTTATCTGTTTTAACAAATTTACTAACCTCAGTTTCC 

TCTATTGCGGCAACACTTCTGAATGTTTTTGTTAGTTTTATTTTTTCAATTTA 

CGTTTTGGCAAACAAGGAGCAGTTGGGACGTCAATTTAATTTGTTAATTGA 

TACCTATTTAGGTTCAACAGGCAAAACATTCCATTACGTTCGTCATATCCTT 

CATCAACGTTTCCATGGTTTTTTTGTAAGCCAAACTTTAGAAGCTATGATTT 

TAGGAAGTTTGACGGTTATTGGTATGTTGATCTTCCAATTTCCTTATGCTTT 

AACAGTTGGGGTTTTAGTTGCTTTTACAGCTCTAATACCGGTTGTGGGAGC 

CTACATTGGTGTTACAATCGGTTTCATCTTAATTGCTACTGAATCGCTTACT 

GAAGCATTCTTGTTTGTTCTTTTCTTGATCCTTTTACAACAATTTGAGGGAA 

ATGTCATTTATCCGAAAGTTGTCGGTGGATCGATTGGACTGCCTTCTATGT 

GGGTTTTAATGGCTATTACTATCGGAGGTGCTTTATGGGGGATCTTAGGCA 
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TGTTACTTGCTGTTCCTGTTGCAGCTACTATCTATCAGATTGTAAAAGATCA 
TATTATCAAGCGACAAACGCTTAGAAATCGTGCACGAACCTATCGTTAA 

MKFEKRQVYYVVITFAICYAIQAYWGAVSNILTTLHKAIFPFLMGAGIAYIINI 

VMSVYERLYIKLFKGSRLLMAIICRSVSMILSYATFIGLIVWLFS1VIPDLISSLSS 

LLVIDTGALAKLVNNLNENKQ1SEALNYMGTDK.DLVSTLSGYSQQJLK.QVLSV 

LTNLLTSVSSIAATLLNVFVSFlFSlYVLANtCEQLGRQFNLLLDTYLGSTGKTFH 

YVRHILHQRFHGFFVSQTLEAMILGSLTVIGMLJFQFPYALTVGVLVAFTALIP 

VVGAYIGVTIGFILIATESLTEAFLFVLFLILLQQFEGNVIYPKVVGGSIGLPSM 

WVLMA1TIGGALWGILGMLLAVPVAATIYQIVKDHIIKRQTLRNRARTYR* 



Sequence description: 

A] Length: 1 134 bp - 378 aa (full-length gene) 

B] ATG start codon is preceded by an typical 
Shine-Dalgamo sequence. Possesses a potential 
leader peptide sequence. 



ID-121 

Identical to ID-68, as described in WO 00/06736 



ID- 122 



Clone 3-16 



GTGATTACAATTAAAAAGGAATCTGTTATCAAACTATTGAAGTATGCTTTT 

GGCATTATAATGGGATTTATTATCTTAGCTATTGTAATAGGTGGGCTCCTA 

TTTGCATACTACGTTAGTCGTTCTCCGAAATTAACCGATCAAGCTTTAAAA 

TCCGTTAACTCTAGTTTGGTTTATGATGGTAATAATAAACTTATTGCCGATT 

TAGGCTCAGAAAAGCGTGAAAGTGTTAGTGCGGATAGCATTCCACTAAAT 

TTGGTTAACGCTATCACTTCTATAGAAGATAAACGTTTCTTTAAACATAGA 

GGTGTCGATATTTATCGTATTTTAGGTGCAGCTTGGCATAACCTTGTTAGTA 

GTAATACGCAAGGTGGTTCAACCCTTGATCAACAGTTGATTAAACTGGCTT 

ACTTTTCTACCAATAAATCTGACCAAACGTTAAAACGTAAATCACAGGAA 

GTTTGGCTTGCGCTTCAAATGGAGCGTAAATACACCAAAGAAGAAATTCTT 

ACTTTCTATATTAATAAAGTTTATATGGGAAATGGGAATTATGGTATGAGA 
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ACAACAGCTAAATCATACTTTGGTAAAGACCTAAAGGAATTATCTATTGCA 
CAACTTGCTTTGCTCGCTGGTATTCCTCAAGCACCTACACAATATGACCCTT 
ATAAAAACCCAGAATCTGCTCAAACAAGACGTAATACCGTTCTTCAGCAG 
ATGTATCAAGATAAAAACATTTCTAAAAAGGAATACGACCAAGCTGTTGC 
5 AACTCCAGTAACTGATGGCTTAAAAGAATTAAAGCAAAAATCTACTTATCC 
AAAATATATGGATAACTACTTAAAACAAGTTATTAGTGAAGTTAAACAAA 
AAACTGGTAAAGATATCTTTACTGCTGGGCTAAAAGTGTATACTAATATCA 
ACACTGATGCACAAAAACAACTATATGACATCTACAACAGTGATACTTAC 
ATCGCTTATCCAAACAATGAATTACAAATAGCATCTACCATCATGGATGCG 

1 0 ACTAATGGTAAAGTCATTGCACAATTAGGCGGGCGTCATCAGAATGAAAA 
TATTTCATTTGGGACAAATCAATCTGTCTTAACAGACCGCGATTGGGGTTC 
TACAATGAAACCTATCTCAGCTTATGCACCTGCTATTGATAGTGGTGTCTA 
TAATTCAACAGGTCAATCATTAAACGACTCAGTTTACTACTGGCCTGGTAC 
TTCTACTCAACTATATGACTGGGATCGTCAATATATGGGTTGGATGAGTAT 

1 5 GC AGACCGCTATTCAACAATCACGTAACGTCCCTGCTGTC AGAGC ACTTGA 
AGCCGCTGGATTAGACGAAGCAAAATCTTTCCTTGAAAAATTAGGCATAT 
ACTATCCAGAAATG 

MITIKKESVIKLLKYAFGUMGFIILAIVIGGLLFAYYVSRSPKLTDQALKSVNSS 
20 LVYDGNNKLIADLGSEICRESVSADSIPLNLVNAITSIEDKRFFKHRGVDIYRILG 
AAWHNLVSSNTQGGSTLDQQLIKLAYFSTNKSDQTLKRKSQEVWLALQMER 
KYTfCEEILTFYlNKVYMGNGNYGMRTTAKSYFGKDLKLELSIAQLALLAGIPQA 
PTQYDPYKNPESAQTRRNTVLQQMYQDKNISKKEYDQAVATPVTDGLKELK 
QKSTYPKYMDNYLKQVISEVKQKTGKDIFTAGLKVYTNLNTDAQKQLYDIYN 
25 SDTYIAYPNNELQIASTIMDATNGKVIAQLGGRHQNENISFGTNQSVLTDRDW 
GSTMKLPISAYAPAIDSGVYNSTGQSLNDSVYYWPGTSTQLYDWDRQYMGWM 
SMQTAIQQSRNVPAVRALEAAGLDEAKSFLEKLGIYYPEM 



30 Sequence description: 

A] Length: 1386 bp - 462 aa (partial sequence) 

B] GTG start codon is preceded by an 
typical Shine-Dalgarno sequence. Possesses a 

35 potential leader peptide sequence. 



ID- 123 

40 



Clone 3-17 
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ATGGCTAATGTATATGATTTAGCAAATGAATTAGAACGTGCTGTTCGTGCT 

TTACCAGAATACCAAGCAGTTTTAACTGCAAAAGCAGCTATTGAAAATGA 

TGCGGATGCACAAGTGCTTTGGCAAGACTTTTTGGCTACCCAATCAAAAGT 

TCAAGAAATGATGCAATCTGGCCAAATGCCAAGTCAAGAAGAACAAGATG 

AAATGTCTAAACTTGGGGAAAAAATTGAATCCAATGACCTTTTAAAAGTTT 

ATTTTGACCAACAACAACGGTTGTCTGTCTATATGTCTGATATCGAAAAAA 

TTGTCTTTGCACCCATGCAGGACTTGATGTAA 

MANVYDLANELERAVRALPEYQAVLTAKAAIENDADAQVLWQDFLATQSK 

VQEMMQSGQMPSQEEQDEMSKLGEK.IESNDLLKVYFDQQQRLSVYMSDIEKI 

VFAPMQDLM* 



Sequence description: 

A] Length: 336 bp - 1 12 aa (full length sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgarno sequence. No obvious 
potential leader peptide sequence. 



ID- 124 



Clone 3-26 



ATGGCAGAAATCACAGCTAAACTTGTAAAAGAATTGCGTGAAAAATCAGG 

TGCAGGCGTTATGGACGCTAAAAAAGCATTAGTAGAAACTGATGGTGACC 

TTGATAAAGCGATTGAATTACTTCGCGAAAAAGGTATGGCTAAAGCAGCT 

AAAAAAGCAGACCGTGTTGCTGCTGAAGGTTTAACAGGTGTTTATGTTGAT 

GGTAACGTTGCAGCAGTTATTGAAGTTAA 

MAE1TAKLVFCELREKSGAGVMDAKKALVETDGDLDKAIELLREKGMAKAAK 
KADRVAAEGLTGVYVDGNVAAVIEV 



Sequence description: 

A] Length: 230 bp - 76 aa (partial sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgarno sequence. No obvious 
potential leader peptide sequence. 
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ID-125 

5 

Clone 3-33 



1 0 ATGATAA AAAACCTGTTATTAAC AGGTTTTTT ATC ATTTAATG ACGGAAAA 
CTGGACACAAATTATTTTTCTTGTATAATTAAATATATTATTTCTTATCAGG 
AGGTTATGATGACATTAGAGAAACGATTTAA 

MDCNLLLTGFLSFNDGKLDTNYFSCIIKYIISYQEVMMTLEKRF 

15 

Sequence description: 

A] Length: 134 bp - 44 aa (partial sequence) 
20 B] ATG start codon is preceded by an 

typical Shine-Dalgarno sequence. Possible 
potential leader peptide sequence. 

25 

ID- 126 



Clone 3-41 

30 

ATGAAAAATAATAAAAATAATGGTTTTCTGAAAAATTCCTTTATTTACATA 
TTATTGATTATTGCGGTTATTACAACCTTTCAATACTATTTAA 

35 MKNNKNNGFLKNSFIYILLIIAVITTFQYYL 



Sequence description: 

40 A] Length: 94 bp - 31 aa (partial sequence) 

B] ATG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. 



-Ji si", ij ';. s"ft ~7i ^I. -"-;? ="51 — T= if "ft g~t '/-V 
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ID- 127 

5 

Clone 3-42 



ATGTTAGATATTATCTTATCCGGAATTTCGCAAGGATTACTTTGGTCAATTA 
1 0 TGGCAATTGGCGTGTTTATCACTTTTCGT ATCTT AG ACAT AGCCGATCTCTC 
TGCAGAAGGGGCTTTCCCTATGGGGGCTGCAGTTTGCGCCTTATGTATCGT 
TAA 

MLDLILSGISQGLLWSIMAIGVF1TFRILDIADLSAEGAFPMGAAVCALCIV 

15 

Sequence description: 

A] Length: 158 bp - 52 aa (partial sequence) 
20 B] ATG start codon is preceded by a 

possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. 

25 

ID- 128 



Clone 3-43 

30 

ATGGAAATGCCTAAAAGAAATGAATTAC TCAATAAAGAAATTAAAATGAG 
TATTGATAAACTTAGATATAAAGAACCAGAGAGTGAACATGACAAGCGAC 
CTACTTTTTATTTGGTAGTACTTATACTTGTTACTGTAGCAGTTATATTGTC 
35 GTTATTTAA 

MEMPKRNELLNKEIKMSIDKLRYKEPESEHDKRPTFYLVVLILVTVAVILSLF 



40 Sequence description: 

A] Length: 161 bp - 53 aa (full-length gene) 

B] ATG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 



Uv.1' .,{'' >ull„ Mm" M.i" ?!I «-..P «.itt!' fl iif 1 *W» "iTtKH 
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leader peptide sequence. 

ID- 129 
Clone 3-44 



GTGGTAAGTAAATTGAGTTTAACAACGATTTTTGCATTGCTATTTTCATCA 
ATGCTAATTTACGCAACACCTCTTATCTTTACAAGTATTGGGGGAACCTTC 
TCTGAACGTGGTGGTATCGTCAACGTTGGTTTAGAAGGAATTATGGTAATT 
GGAGCTTTCTCAGGCGTTGTATTTAA 

MVSKLSLTTIFALLFSSMLIYATPLIFTSIGGTFSERGGIVNVGLEG1MVIGAFSG 
VVF 



Sequence description: 



A] Length: 1 79 bp - 59 aa (partial sequence) 

B] GTG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. 



ID- 130 



Clone 3-46/47 



ATGAGAATTATTGCAATAACTGAAAAGGTTATAAAAGAACTGTTTCGTGAT 

AAAAGAACACTTGCTATGATGTTTTTAGCACCTATTTTAATTATGTTTTTGA 

TGAATGTTATGTTTTCTGCGAATAGTAATACAAAAGTTAAGATTGGAACTA 

TTAACGTTAACACGAAGGTCGTTTCAAATTTAGATAATATTAAGCATATTC 

AAGTGAGATCATTTAAATTTAACTCATCTGCTAAAAAAGCACTCAAATCAA 

ATAAAATTGATGCTCTTATTTCGGAGGACAATAAATCTTATACTGTCTTCT 

ATGCGAATACAGATTCTTCAAAGACGACTTTAACAAGACAAGCTTTTAAA 

ACCGCTGTTAATACAATGAACAGTAAGGAACTGATTTCGCAAGTTAAAATT 

TTAGCTAATAAGAATCCGAAACTAGCACAATCCTTACAAACTCGCTCCAAA 

TATATCAAAGAAAAATATAATTACGGAAATAAAAATACAGGCTTTTTTGC 

AAAAATGATACCAATACTAATGGGATTTATGGTCTTCTTCTTGGTTTTT 



*:! *S~h H^li '-3 fT:: JPfc ~V " jf£ tT^S: r~^ : 
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MRIIAITEKV1KELFRDKRTLAMMFLAPILIMFLMNVMFSANSNTKVKIGT1NV 
NTKVVSNLDNIKHIQVRSFKFNSSAKKALKSNK1DALISEDNKSYTVFYANTDS 
SKTTLTRQAFKTAVNTMNSKLELISQVKILANKNPKLAQSLQTRSKYIKEKYNY 
5 GNKNTGFFAKMIPILMGFMVFFLVF 



Sequence description: 

10 A] Length: 558 bp - 1 86 aa (partial sequence) 

B] ATG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. C-terminus has yet to be 
determined. 



ID-131 

20 

Clone 3-48 



GTGATTATCGTTATGAGTAAACATCAAGAAATTTTGGAGTACCTAGAAAAT 
25 TTAGCTGTTGGTAAGAGGGTTAGTGTACGCAGTATTTCAAATCATTTAA 

MIIVMSKHQEILEYLENLAVGKRVSVRSISNHL 



30 Sequence description: 

A] Length: 100 bp - 33 aa (partial sequence) 

B] GTG start codon is not preceded by a 
obvious Shine-Dalgarno sequence. No obvious 

35 leader peptide sequence. 



ID- 132 

40 



Clone 2-c53 
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ATGTATAGAGAAATTACCGCTGTCGAACACGATCGCTTTGTGAGCGAATCC 

AACCAAACAAACCTACTTCAATCTCTTAATTGGCCCAAAGTAAAAGACAA 

CTGGGGTAGTCAATTACTTGGCTTTTTTGACGGTGAAACCCAAATTGCCAG 

CGCTAGTATTCTCATCAAATCACTTCCTCTTGGCTTCTCCATGCTGTATATT 

CCGCGTGGACCAATCATGGATTACTCCAATCTAGATATTGTAACTAAGGTC 

CTTAAGGACCTTAAAGCTTTTGGCAAAAAACAAAGAGCTCTCTTTATCAAG 

TGTGATCCTCTCATCTATTT 

MYREITAVEHDRFVSESNQTNLLQSLNWPKVKDNWGSQLLGFFDGETQIASA 

SlLIKSLPLGFSMLYrPRGPIMDYSNLDIVTKVLKDLKAFGICKQRALFJKCDPLI 

Y 



Sequence description: 



A] Length: 326 bp - 108 aa (partial sequence) 

B] ATG start codon is preceded by an obvious 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence. 



ID-133 



Clone 2-c59 



ATGGACAAGAAAAAAATCTTAGTAACGGGTATTGTGCCTAAAGAAGGTCT 

AAGAAAGCTTATGGACCGATTTGATGTTACTTATTCAGAAGATCGCCCATT 

TTCACGTGACTATGTGTTAGAGCATTTATCTGAATATGACGGATGGTTACT 

CATGGGACAAAAAGGTGATAAAGAGATGATTGATGCAGGTGAAAACTTAC 

AAATTATTTCTTT 

MDKKKILVTGIVPKEGLRKLMDRFDVTYSEDRPFSRDYVLEHLSEYDGWLLM 
GQKGDKEMIDAGENLQIIS 



Sequence description: 



A] Length: 215 bp - 71 aa (partial sequence) 
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B] ATG start codon is preceded by an obvious 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence. 



ID-134 



10 Clone 2-c62 



ATTTCGAAAGATGACTACCAAAATATTAGTTTTGGACAGGATCCAGAAGTT 
GTTGATTATGCTGGTCTGTTTGAAAAACGCCGTCCAGTTTTAGAAAAAGCA 

1 5 GTTAAAAATTTCTTGCAAGAAGAGAGAGCTACGAGAATGCTATCTGATTTC 
TTGCAAGAAGAAAAATGGGTAACTGATTTTGCTGAATTTATGGCGATCAA 
AGAACATTTTGGTAATAAGGCGCTTCAAGAATGGGATGACAAGGCTATTA 
TACGCCGCGAAGAAGAAGCCTTAGCAGGATATCGTCAAAAGCTTAGTGAA 
GTGATAAAATATCATGAAGTAACGCAATATTTCTTTTACAAACAATGGTTT 

20 GAGTTAAAAGAATATGCTAATGATAAAGGGATTCAAATTATCGGTGATAT 
GCCAATCTACGTTTCTGCCGATAGTGTAGAAGTTTGGACAATGCCTGAACT 
GTTT 

ISKDDYQNISFGQDPEVVDYAGLFEICRRPVLEKAVKNFLQEERATRMLSDFLQ 
25 EEKWVTDFAEFMA1KEHFGNKALQEWDDKAIIR.REEEALAGYRQKLSEVIKY 
HEVTQYFFYKQWFELKEYANDKGIQIIGDMPIYVSADSVEVWTMPELF 



A] Length: 459 bp - 153 aa (partial sequence) 
30 B] More sequencing is required to determine the 

N- and C-termini 

enzyme). - Streptococcus pneumoniae (63%) 



40 



ID- 135 

Identical to ID-108 described in WO 00/06736 

Clone 2-c63 
ID- 136 



Clone 2-c66 
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ATGGCAAAACAGAAAAATAACTGGCGCCGTGTTGGAGTTGGTGTCCTTAC 
ACTTGCTTCAGTTGCGACTCTTGCTGCATGTGGAAGTAAATCAGCTTCCCA 
5 GGATTCTAATGGAGCGATTAATTGGGCTATTCCAACAGAAATCAATACACT 
AGATTTATCTAAAGTTACAGACACTTACTCAAATCTAGCTATTGGTAACTC 
TAGTAGTAATTTCCTTCGCTTAGATAAAGATGGAAAGACAAGACCAGACTT 
GGCTACTAAAGTTGATGTTTCAAAAGATGGCTTAACTTATACAGCTACATT 
ACGTAAAGGCTTGAAGTGGTCAGATGGCAGTAAACTTACTGCAAAGGATT 

1 0 TTGTTTATTCATGGC AACGTTT AGTTGATCCTAA AAC AGCTTC AC A ATATG 
CTTACCTTGCTGTTGAAGGGCATGTGCTTAATGCCGATAAAATCAACGAAG 
GACAAGAGAAAGACTTGAATAAGCTAGGTGTTAAGGCAGAAGGCGATGA 
CAAAGTTGTTATTACTTTATCTAGTCCGTCTCCGCAATTCATCTACTACCTT 
GCATTCACTAACTTCATGCCACAAAAACAAGAAGTTGTTGAAAAATATGG 

1 5 AA AAGATTACGC AACTACTTC AAAA AATAC AGTTT ACTC AGGACCATATA 
CTGTTGAAGGTTGGAATGGTTCGAATGGTACTTTCACGCTGAAGAAAAAC 
AAAAATTATTGGGACGCTAAAAATGTAAAAACAAAAGAAGTTCGCATCCA 
GACTGTTAAAAAACCAGATACCGCCGTTCAAATGTATAAACGTGGTGAGT 
TAGATGCAGCTAATATCTCAAATACTTCTGCTATTTATCAAGCTAATAAAA 

20 ATAATAAAGATGTCACAGATGTTCTAGAAGCGACCACTGCCTATATGGAA 
TATAATACTACTGGTTCTGTGAAAGGGCTTGATAATGTTAAGATTCGTCGC 
GCCTTAAACTTAGCAACTAACCGTAAAGGAGTTGTTCAAGCAGCCGTTGAT 
ACAGGCTCAAAACCGGCAATTGCTTTTGCACCTACTGGTTTAGCCAAAACA 
CCAGATGGAACTGATTTGGCAAAATATGTTGCCCCAGGTTATGAATATAAT 

2 5 A A A ACTG A AGC AGC A A AACTCTTT AG ACT A 

MAKQKNNWRRVGVGVLTLASVATLAACGSKSASQDSNGAl>JWAIPTErNTLD 
LSKVTDTYSNLAIGNSSSNFLRLDKDGKTRPDLATKVDVSKDGLTYTATLRKG 
LKWSDGSKLTAKDFVYSWQRLVDPKTASQYAYLAVEGHVLNADKINEGQEK 
30 DLNKLGVKAEGDDKVVITLSSPSPQFIYYLAFTNFMPQKQEVVEKYGKDYAT 
TSKNTVYSGPYTVEGWNGSNGTFTLKKNKNYWDAKNVKTKEVR1QTVKKPD 
TAVQMYK^GELDAANISNTSAIYQANKJVTNKDVTDVLEATTAYMEYNTTGSV 
KGLDNVKIRRALNLATNRKGVVQAAVDTGSKPAIAFAPTGLAKTPDGTDLAK 
YVAPGYEYNKTEAAKLFRL 

35 

Sequence description: 



40 



A] Length: 1143 bp - 381 aa (partial sequence) 

B] Shine-Dalgarno sequence precedes ATG codon. 
Possesses a potential leader peptide sequence. 
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ID-137 



Clone 2-c67 

5 

TTGAGAGTTTATGAAAATAAAGAAGAGTTGAAAAAAGAAATAAGTAAAAC 
ATTTGAGAAATACATTATGGAATTTAATAA 

TATTCCAGAGAATCTAAAAGATAAAAGAATTGATGAAGTTGATAGAACTC 
1 0 C AGC AGAAAACCTTTCTTATC AGGTTGGCT 

GGACCAACTTGGTTCTTAAATGGGAAGAAGATGAAAGAAAGGGACTTCAA 
GTAAAAACACCATCGGATAAATTT 

MRVYENKEELKKEISKTFEKYIMEFNNIPENLKDKR1DEVDRTPAENLSYQVG 
15 WTNLVLKWEEDERKGLQVKTPSDKF 



Sequence description 

20 A] Length: 234 bp - 78 aa (partial sequence) 

B] TTG start codon is preceded by a 
potential Shine-Dalgamo sequence. No obvious 
leader peptide sequence. 

25 

ID- 138 



30 Clone 2-c70 



ATGTCAAAGTTTGATAGTCAGAAAATAATTACTCCGATTATGAAGTTTGTC 
AATATGCGAGGGATTATTGCACTCAAAGATGGCATGCTAGCAATTTTACCA 
3 5 CTAAC AGTTGTTGGG AGTCTCTTTTTAAT ATTAGGGC AGCTTCC ATTT 

MSlCFDSQKIITPIMKFVNMRGnALKDGMLAILPLTVVGSLFLlLGQLPF 



40 Sequence description 

A] Length: 150 bp - 50 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
Shine-Dalgarno sequence. Possesses a potential 



5 



30 



35 
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leader peptide sequence. 

ID- 139 
Clone 2-c71 



1 0 g ag acc acttc atc agtta aaccagcagg aattg accgtatcaatc at acc 
tcaacacccccgaagaaaactacccccaacattgcaacgacgcatagctt 
caaagatcgttgtgatactttagaaagaattcacaatgaagacattgatgt 
ttgttctggattcatttgtggtatgggagagagcgatgaggggctcatcac 
attagctttcagactaaaagaactgaacccctattctatccctgtcaatttt 

1 5 ttacttgctgttgaaggaacacctcttggaaaatataactatttgactccc 
attaaatgcttaaaaattatggccatgttgcgttttgtttttcctttcaagg 
aattaagattaagtgctggacgggaggtccattttgagaattttgaatcat 
tagtcaccttacttgttgactcaacttttttgggaaattacctaacagagg 
ggggtcgcaatcaacataccgatattgaattcttggaaaaattacaacta 

20 aatcatactaaaaaggaattaattt 

ettssvkpagtdritmhtstppkkttpniatthsfk:drcdtlerihnedidvcsgfi 
cgmgesdeglitlafrlkelnpysipvnfllavegtplgkynyltpikclkima 
mlrfvfpfkelrlsagrevhfenfeslvtllvdstflgnylteggrnqhtdief 
25 leklqlnhtkk.eli 



Sequence description: 



A] Length: 535 bp - 178 aa (partial sequence) 

B] N- and C-termini require verification 



ID- 140 



Clone 2-c73 



40 



ATGCCGGTTTGGACTGCACAGTCTATTCCAAAGGCATTTTTAGAAAAGCAT 
AATACTAAGGAAGGCACCTGGGCAAAACTAACCATTCTAAGTGGTTCTTTA 
GTATTTTACCAGTTATCTCCTGATGGAGAGGAAATCTCGCGGCATATTTTT 
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GATGCTAGTAGTGATATTCCTTTTGTTGATCCACAAGTCTGGCATAAAGTT 
TCGCCGAATAGTCCAGACTTAAGTTGCTATCTAACTTTTTACTGCCAAAAA 
GAAGATTACTTCCATAAAAAATATGGTCTCACGCGCACACATTCTGAGGTT 
ATCGCCAGTGCACCTCTCTTATCTGAGAAGAGTAATATATTAGACCTTGGG 
5 TGTGGTCAAGGGCGAAACTCACTTTATTTATCGCTGCTGGGACATCAAGTG 
ACTTCTGTCGATTCAAACGGACAGAGCCTTGTAGCTTTAGAAAATATGGCA 
TTAGAAGAAGAGCTTCCTTACAATATAAAAAGGTATGATATTAATACTACT 
GCTATTGAAGGGCACTATGATTTTATTTTATCAACTGTGGTATTTATGTTTT 
T 

10 

MPVWTAQSIPKAFLEKHNTKEGTWAKLTILSGSLVFYQLSPDGEEISRH1FDAS 
SDEPFVDPQVWHKVSPNSPDLSCYLTFYCQFCEDYFHKKYGLTRTHSEVIASAP 
LLSEKSNILDLGCGQGRNSLYLSLLGHQVTSVDSNGQSLVALENMALEEELPY 
NIKRYDINTTAIEGHYDFILSTVVFMF 

15 

Sequence description: 



20 A] Length: 563 bp - 187 aa (partial sequence) 

B] N- and C-termini require verification 



25 ID-141 



Clone 2c76 

30 

ATGACAAAGCAAATAATTGCCATTTGGGCTGAAGATGAAGACCATTTGAT 
TGGAGTTAATGGCGGTTTACCATGGAGGCTTCCTAAAGAGTTACATCACTT 
CAAAGAAACGACCATGGGGCAGGCTTTGCTTATGGGACGAAAGACCTTTG 
ATGGAATGAACCGTCGTGTTTTACCTGGTAGAGAGACAATCATCTTAACAA 
3 5 AAGATG A AC AATTCC AAGC AG ATGG AGTG AC AGTCCTAAAT AGTGTTGAA 
CAAGTTATAAAATGGTTTCAGGAACATAATAAGACCTTATTTATTGTAGGT 
GGTGCAAGTATTTATAAAGCATTTCTGCCTTATTGTGAAGCAATCATAAAA 
ACTAAAGTTCATGGAAAATTCAAAGGTGATACCTATTTTCCTGATGTTAAT 
CTATCTGAGTTT 

40 

MTKQIIAIWAEDEDHLIGVNGGLPWRLPKELHHFKETTMGQALLMGRKTFDG 

MNRRVLPGRETIILTKDEQFQADGVTVLNSVEQVIKWFQEHNKTLFIVGGASI 

YKAFLPYCEAIIKTKVHGKFKGDTYFPDVNLSEF 
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Sequence description: 



A] Length: 417 bp - 139 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence. No leader peptide sequence 



10 ID-142 

Clone 2-c78 

15 

TTGTGGCCAAACTGTGCCCCGCTTATTAATAGCACTTTGTTCACCATTGAA 
GATATCTTAACATCAGGTGCTCATAGCAACCCTATTTTAATGGGGGTTATA 
CTTGGCGGGACAATTGTAGTAGTGGCGACAGCACCACTTTCTTCTATGGCA 
TTGACAGCTATGCTAGGATTAACCGGAATGCCTATGGCTATAGGAGCCTTG 

20 TCTGTCTTTGGTTCGTCATTTATGAATGGTGTACTTTTCCATAAATTAAAAC 
TTGGAAGTCGTAAAGATAATATAGCTTTTGCTGTTGAGCCTCTAACTCAAG 
CTGACGTGACTTCAGCTAACCCTATTCCAATCTATGTCACTAATTTTGTTGG 
TGGTGCAGCTTGTGGTATTTTAATTGCCTTGATGAAATTAGTTAATGATACT 
CCTGGAACAGCGACACCAATTGCAGGATTTGCTGTCATGTTTGCCTATAAC 

25 CCAATGATAAAAGTACTAATAACCGCTCTAGGTTGTATTATCCTATCTTTA 
CTAGCAGGCTATTTTGGAGGCATTGTTTTT 

MWPNCAPLINSTLFTIEDILTSGAHSNP1LMGVILGGTIVVVATAPLSSMALTA 
MLGLTGMPMAIGALSVFGSSFMNGVLFHKLKLGSRKDNIAFAVEPLTQADVT 
30 SANPIPIYVTNFVGGAACGILIALMKLVNDTPGTATPIAGFAVMFAYNPMIKVL 
1TALGCIILSLLAGYFGGIVF 



Sequence description: 

35 



A] Length: 540 bp - 180 aa (partial sequence) 

B] N- and C-termini have yet to be elucidated 



40 



ID- 143 
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ID- 145 

5 

Clone 3-86 



ATGTCATATTTTAGAAATTACTGGTATCGTTTTGGAGCAATTTTATTTATTA 
1 0 TTTTAGCAGTAATATTGCTTGTTTTTAGACCTGACTGGTCAATGCTTCACTA 
TCTATTGTATTTTTACTTTATGGCACTTCTAGCGCATCAATTTGAAGAATAT 
CAGTTTCCCGGTGGGGCATCACCTATCATTAACTATGTTGTTTATGATGAA 
GAAGAGCTGATGGATTGTTTTCCAGGCAATACTCAGTCTATTATGTTGGTT 
AATACTATTGCTTGGTTGCTTTACATTGCTAGTATTGCTTTTCCTCAAGCTT 
1 5 ATTGGCTTGGATTAGGAGTC ATGTTCTTTAGTCTAACGCAGCTCTTGGGTC 
ATGGTTTTCAGATGAATATTAAACTTAAAACTTGGTATAATCCTGGTCTAG 
CAACGACAGTATTTCTCCTAGTACCAATAGCTTGCGCATACATCTATCAAG 
CTAGTGCAGAAGGAATGCTCACTTGGGGAGATTGGCTAGGTGGTTTTATCA 
TGTTGATTGTCTGTGTACTAACTAGCATTATTGCACCTGTACAGCTATTGAA 
20 GGATAAGGAGACCAATTATATTATTAGTCCTTGGCAAATGGACCGTTTTCA 
TAAGGTCGTTAATTTTGTAAGGATAAAAAAATAA 

MSYFRNYWYRFGAILFIILAVILLVFRPDWSMLHYLLYFYFMALLAHQFEEYQ 
FPGGASPIINYVVYDEEELMDCFPGNTQSIMLVNTIAWLLYIASIAFPQAYWLG 
25 LGVMFFSLTQLLGHGFQMNIKXKTWYNPGLATTVFLLVPIACAYIYQASAEG 
MLTWGDWLGGFIMLIVCVLTSIIAPVQLLKDKLETNYIISPWQMDRFHKVVNFV 
ROCK* 



30 Sequence description: 



A] Length: 651 bp - 219 aa (full length gene) 

B] Putative ATG start codon is preceded by a 
35 typical Shine-Dalgarno sequence. Possesses a 

potential leader peptide sequence. 



40 ID- 146 



Clone 3-c88 
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Clone 2-c80 



ATGTTTTTAAGTATAATGGCAGGTGTCATAGCATTTGTCCTGACAGTTATT 
5 GCCATTCCACGCTTCATTAAGTTTTACCAATTGAAGAAAA TTGGCGGGCAA 
CAAATGCATGAAGATGTCAAACAACATCTAGCCAAAGCAGGTACGCCGAC 
AATGGGAGGAACGGTATTTT 

MFLS1MAGV1AFVLTVIA1PRFIKFYQLKKIGGQQMHEDVKLQHLAKAGTPTMG 
10 GTVF 



Sequence description: 

15 

A] Length: 172 bp - 57 aa (partial sequence) 

B] Shine Dalgarno sequence precedes 'ATG' start 
codon. Possesses a potential leader peptide 
sequence. 

20 



ID- 144 

25 

Clone 3-83 



ATGAAACCATATTTATCTTTTATTGGTAGAACGTTATTATACTTCGGTATTT 
30 TATTGTTACTAATTTACTTTTTTGCATACCTTGGTCGCGGACAAGGCAGTTT 
TATTTATAA 

MKPYLSFIGRTLLYFGILLLL1YFFAYLGRGQGSFIY 

35 

Sequence description: 



A] Length: 113 bp - 37 aa (partial sequence) 
40 B] Putative ATG start codon is preceded by a 

typical Shine-Dalgarno sequence. Possesses a 
potential leader peptide sequence. 
This orf is not in frame with nuc 
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ATGCCACTTACAGCACTTGAAATTAAAGATAAAACATTTTCATCAAAATTT 
CGCGGTTATAGCGAAGAAGAAGTT 

MPLTALEIICDKTFSSKFRGYSEEEV 



Sequence description: 



A] Length: 75 bp - 25 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. No leader 
peptide 



ID- 147 



20 Clone 3-90 



ATGTCACTTTTTCAAGAAAAAATTGCTTACAATTGCGCTAAAAAGGAAGCG 
CTTTATAAAGAGAGTTTAGGACGCTACGCCTTGAGATCAATGCTAGCAGG 

25 GGCTTATTTGACAATGAGTACTGCTGCCGGTATCGTCGCAGCTGATACTAT 
TGGTAAAATTTCTCCTGCTCTATCAGGTTTTGTATTTGCTTTCATCTTTAGTT 
TTGGACTTATTTATGTTTTAATATTTAATGGTGAATTGGCGACATCTAATAT 
GCTTTATCTCACTGCAGGAGCCTATAATAAAAATATCTCTTGGAAAAAAGC 
CATAACAATTTTAATTTATTGTACTTTTTTCAACCTCGTTGGTGCTTGTATA 

30 TT AG CTTGGTTGTTT AA 

MSLFQEKIAYNCAKKEALYKESLGRYALRSMLAGAYLTMSTAAG1VAADT1G 
KISPALSGFVFAFIFSFGLIYVLIFNGELATSNMLYLTAGAYNKNISWKKAITILI 
YCTFFNLVGACILAWLF 

35 

Sequence description 



40 A] Length: 406 bp - 125 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. Possible 
leader peptide 
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ID- 148 



Clone 3-92 



AAGTTACAAGCGACTGAAGTTAAGAGCGTTCCGGTAGCACAACCAGCTTC 
1 0 AAC AAC A AATGC AGT AGCTGC AC ATCCTG A AA ATGC AGGGCTCC AACCTC 
ATGTTGCAGCTTATAAAGAAAAAGTAGCGTCAACTTATGGAGTTAATGAA 
TTCAGTACATACCGTGCGGGAGATCCAGGTGATCATGGTAAAGGTTTAGC 
AGTTGACTTTATTGTAGGTAAAAACCAAGCACTTGGTAATGAAGTTGCACA 
GTACTCTACACAAAATATGGCAGCAAATAACATTTCATATGTTATCTGGCA 
1 5 ACAAAAGTTTTATTCAAATACAAATAGTATTTATGGACCTGCTAATACTTG 
GAATGCAATGCCAGATCGTGGTGGCGTTACTGCCAACCACTATGACCACGT 
TCACGTATCATTTAA 

KLQATEVKSVPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVASTYGVNEF 
20 STYRAGDPGDHGKGLAVDFIVGKNQALGNEVAQYSTQNMAANNISYVIWQQ 
KFYSNTNSIYGPANTWNAMPDRGGVTANHYDHVHVSF 



Sequence description 

25 

A] Length: 419 bp - 139 aa (partial sequence) 

B] N- and C-termini have yet to be determined 

30 



ID- 149 



35 Clone 3-94 



ATGATTCCAGTAGTTATTGAACAAACAAGTCGTGGTGAACGTTCTTATGAT 
ATTTACTCACGTCTTTTAAAAGATCGTATTATTATGTTGACAGGCCAAGTT 
40 GAGGATAATATGGCCAATAGTATCATTGCACAGTTATTGTTTCTCGATGCA 
CAAGATAATACAAAGGATATTTACCTTTATGTCAATACACCAGGTGGTTCA 
GTATCGGCTGGACTTGCTATTGTGGACACCATGAACTTCATTAAATCGGAC 
GTACAGACGATTGTTATGGGGATGGCTGCTTCGATGGGAACCATTATTGCT 
TCAAGTGGTGCTAAAGGAAAACGTTTTATGTTACCGAATGCAGAATATATG 



5y/y o 
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ATCCACCAACCAATGGGCGGAACAGGCGGAGGTACACAGCAATCTGATAT 

GGCTATCGCTGCTGAGCATCTTTTAAAAACGCGTCATACTTTAGAAAAAAT 

CTTAGCTGATAATTCTGGTCAATCTATTGAAAAAGTCCATGATGATGCAGA 

GCGTGATCGTTGGATGAGTGCTCAAGAACACTTGATTATGGCTTTATTGAT 

GCTATTATGGAAAATAATAATTTACAATAATAGATTTAAAAGAGTTGAGTT 

TACCAACTCTTTTTTTATTTGTTGGAATTATGTTATAATCTTAGTAATTACA 

GATATGACGCAGAAAGGAAAAAATTATTGA 

M1PVVIEQTSRGERSYDIYSRLLKDRIIMLTGQVEDNMANSIIAQLLFLDAQDN 

TKLDIYLYVNTPGGSVSAGLAIVDTMNF1KSDVQTIVMGMAASMGTIIASSGAK 

GKRFMLPNAEYMIHQPMGGTGGGTQQSDMAIAAEHLLKTRHTLEKILADNSG 

QSIEKVHDDAERDRWMSAQEHLIMALLMLLWKIIIYNNRFKRVEFTNSFFICW 

NYVIILVITDMTQKGKNY* 



Sequence description 

A] Length: 693 bp - 231 aa (full length gene) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. No leader 
peptide. Significantly, it would appear to have a 
very hydrophobic C-terminus. 



ID- 150 



Clone 2-c86 



ATGAAACCAAAAaTTATTGGTGTACTTGGTCTAGGAATATTTGGACAAACA 
CTCGCACAAGAACTAAGTAACTTTGAACAAGATGTTATTGCTATTGACAGC 
AATCCTGAAAATGTACAAGCTGTCGCCGAAGT 

TGTTACAAAAGCAGCTATCGGAGACATTACTGATTTAGCTTTCCTAAAACA 

CATCGGGATCAGTGACTGTGATACTGTTATTATTGCTACAGGAAACAGTTT 

AGAGAGCTCAGTATTGGCCGTAATGCACTGTAAAAAGTTAGGCGTCCCAC 

AAGTTATTGCTAAAGCTCGAAACCTTGTATACGAAGAAGTACTTTATGAAA 

TTGGTGCTGATTTGGTTATCTCTCCGGAGCGAGAATCTGGGCAAAATGTTG 

CTGCAAACCTCATGAGAAATAAAATTACAGATGTCTTCCAGATTGAATCTG 

ATATTTCTGTCATTGAATTT 

MKPKIIGVLGLGIFGQTLAQELSNFEQDVIAIDSNPENVQAVAEVVTKAAIGDI 
TDLAFLKHIGISDCDTV11ATGNSLE 



«rt»H n w V HtuP k it>' ,-«?., qmp *!t,ip .t» 
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SSVLAVMHCKKLGVPQVIAKARNLVYEEVLYEIGADLVISPERESGQNVAAN 
LMRNKITDVFQIESDISVIEF 



Sequence description: 

A] Length: 459 bp - 153 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. Possesses a 
potential leader peptide sequence. 

This orf is not in frame with nuc 



ID-151 



Clone 2-c88 



GTGCGTTATAGTAAAGAGATTATTCAGTTAGCTATACCAGCTATGATTGAA 

AATATCTTACAAATGCTCATGGGAGTAGTTGATAATTATCTAGTGGCTCAG 

TTAGGTGTTGTAGCAGTATCAGGTGTTTCAGTTGCTAATAATATAATTACT 

ATTTATCAAGCTATTTTTATAGCTTTAGGGGCGAGTATAGCAAGTCTATTG 

GCCAAGTCGTTAGCAGGTAGTGAGAAGGATGATGCAATTTCAGTATGTTCT 

CAAGCCATTTTTCTAACATCACTGATAGGGGCAGTATTAGGAATTATCTCG 

ATTGTTTTTGGACAAACTTTCTTT 

MRYSICEIIQLA1PAM1EN1LQMLMGVVDNYLVAQLGVVAVSGVSVANNI1TIY 
QAIFIALGASIASLLAKSLAGSEKDDAISVCSQA1FLTSLIGAVLGIISIVFGQTFF 



Sequence description 



A] Length: 330 bp - 1 10 aa (partial sequence) 

B] Putative GTG start codon is preceded by a 
typical Shine-Dalgarno sequence. May have a 
leader peptide 



ID- 152 
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Clone 2-c92 

5 

TTGATTAACAAGTATTCGTGCTTTTTGAAGAGGATTCTCCATAATAATACT 
CCTTTAATAGTTATCGTGAGAAGTATTTTAAAGAAAAACCGCCAAGGTAG 
AGCGACATTTCTGCCTTTAACTACAATAAAACCAAGAGAATTAGCACAAC 
ATTATCTCTCAAAATTACAAAGTTCTCAAGGGTTTTTAGGAATAGCTAGTG 
1 0 AATTGGTAACCTATGATCAACGCTTGTCAA AC ATTTTT 

MINKYSCFLKRILHNNTPLIVIVRSILKKNRQGRATFLPLTTIKPRELAQHYLSK 
LQSSQGFLGIASELVTYDQRLSNIF 

15 Sequence description 



A] Length: 240 bp - 80 aa (partial sequence) 

B] No obvious Shine Dalgarno sequence precedes the Putative TTG start 
20 codon 



ID- 153 

25 



Clone 2-c94 



3 0 TTGTTG ACTC AC AA A A AT AT ATT ATT AACC ATT AT ATTTGG ATT ATTT ATG A 
TTATATTATCAGCATGTGGTATGTCTAATAAGGAAATGGCTGGTATTGATA 
ATTGGGAACATTATCAAAAGGAAAAGAAAATTACTATTGGATTTGATAAT 
ACTTTTGTTCCTATGGGATTTGAAAGTCGTTCTGGTGACTATACCGGCTTTG 
ATATTGATTTAGCTAATGCTGTTTTTAAAGAATACGGTATTTCAGTGAAAT 

3 5 GGC AGCCTATTA ACTGGG AT ATG A A AG A AACTG AACTT AAT A ATGGT A AT 
ATAGACCTTATTTGGAATGGTTATTCAAAAACGGCAGAACGTGCTAAAAA 
AGTCGCTTTTACAAACCCATATATGAATAATCATCAAGTAATTGTTACTAA 
AACTTCATCACATATTAATAGTATTAAGGATATGAAGGGGAAAAAACTAG 
GAGCCCAGTCGGGTTCATCTGGTTTTGATGCTTTTAACGCTAAACCTGATA 

40 TTTTAAAAAAGTTTGTAAAAGGAAAAGAAGCAGTTCAATACGATACTTTC 
ACTCAGGCTTTGATTGATTTAAAAAATAACCGTATTGATGGTCTTTTGATT 
GATGAAGTTTATGCTAACTATTATTTAAAGCAAGAAGGAA 



m'tii n-^v «v «in M *t««t> n».iV ki* 
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MLTHKN1LLTIIFGLFMIILSACGMSNKEMAGIDNWEHYQKEKKITIGFDNTFV 
PMGFESRSGDYTGFDIDLANAVFKEYGISVKWQPINWDMKETELNNGNIDLI 
WNGYSKTAERAKKVAFTNPYMNNHQVIVTKTSSH1NSIKDMKGKKLGAQSG 
S SGF D AFN AKPDILKKF V KGKE A VQ YDTFTQ A LI DLKNNR LD GLLIDE V Y ANY 
YLKQEG 



Sequence description 

A] Length: 649 bp - 216 aa (partial sequence) 

B] TTG start codon is preceded by a possible 
typical Shine-Dalgamo sequence. Has a 
leader peptide 



ID- 154 



Clone 2-clOO 



ATGAAAATTTGGAAAAAAATAACCTTAATGTTTTCTGCAATTATTTTAACA 

ACAGTAATTGCATTGGGAGTCTATGTTGCCTCAGCTTATAATTTTTCGACTA 

ATGAATTGTCTAAGACTTTT 

MKIWKKITLMFSAIILTTVIALGVYVASAYNFSTNELSKTF 



Sequence description 



A] Length: 123 bp - 41 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Has a 
typical leader peptide 



ID-155 



Clone 2-cl 
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ATGAAAAAACAAAGACTATTACTGCTTTTTGGAGGCTTATTAATAATGATA 

ATGATGACAGCATGTAAGGATTCAAAAATCCCAGAAAACCGCACGAAAAA 

GGAATACCAGGCAGAACAGAATTTTAAGTCATACTTTAAATATATATCAG 

ATAAAAATAACTATTTAGATAATATAAAAGTTTATTACTTTTCTATAAGTA 

TTTCTAAAGATGTACAAGATAAAGTCAGTGAAACAACAACTTGTTCATATA 

GACTAGAAAAGCAAAAGAATCAAGAGTTCATTGGTAATTTTGAACATGAA 

GTTAGTGAATCTAGTCAATATTCAACCGAAGTTAAAAATCAAATACAGTAT 

CCAATCCAGTATAAAGATAATTCAATTCGTTTTACTGAAAAAACACCGTCA 

GAACGTTATGATGAGTTTGTTTTTAGTTCATTTGATTCTTCATTATTAAAAA 

AATATAAAATATATGATTACTTACTAAAACATCCCGAAACTGAATTAAAA 

GGTGTTTCCTATAAGATTCCTATAAATTCTGAAATTGTAGCCCCTTTTATAA 

ATCAATTAAATATAAAAAATCCTAAAAAATCATCTATTTCGGTTACAAAAA 

CGGAAAGTAAAGAATATTATTATACAATCAGTATTGATACTGATTCTGAGA 

TATATTCTATATTCGAAGGTATTCAT 

MKKQRLLLLFGGLLIMIMMTACKDSKIPENRTKKEYQAEQNFKSYFKYISDKN 

NYLDNIKVYYFSISISKDVQDKVSETTTCSYRLEKQKNQEFIGNFEHEVSESSQ 

YSTEVKNQIQYPIQYKDNSIRFTEKTPSERYDEFVFSSFDSSLLKKYKIYDYLLK 

HPETELKGVSYKIP1NSEIVAPFINQLNIKNPKKSSISVTKTESKEYYYTISIDTDS 

EIYSIFEGIH 



Sequence description 



A] Length: 687 bp - 229 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Has a 
typical leader peptide. C-terminus has yet to be 
verified 



ID- 156 



Clone 2-c5 



ATGACATTTGACACCATTGATCAATTAGCGGTTAATACAGTCCGCACGCTT 
TCTATTGATGCTATCCAAGCAGCAAATTCTGGGCACCCAGGTCTTCCTATG 
GGAGCTGCGCCTATGGCTTATGTGCTTTGGAATAAATTCTTAAATGTAAAC 
CCAAAAACAAGTCGCAATTGGACAAACCGTGACCGTTTTGTACTTTCAGCT 
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GGGCATGGTTCAGCTCTTCTTTATAGCCTACTTCATTTAGCTGGCTATGATT 
TATCAATTGATGATTT 

MTFDTIDQLAVNTVRTLSIDAIQAANSGHPGLPMGAAPMAYVLWNKFLNVNP 
5 KTSRNWTNRDRFVLSAGHGSALLYSLLHLAGYDLSTDD 



Sequence description 

10 

A] Length: 272 bp - 90 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. No obvious 
leader peptide 



ID- 157 

20 

Clone 2-c8 



ATGAGAACACTATTTAGAATGATATTTGCTATTCCAAAGTTTATCTTTAGA 
25 TTGATTTGGAATATCATTTGGGGAATATTCAAGACAGTTCTTGTTATTGCG 
ATTATTTTATTTGGCTTGTATTACTATGCGAATCACAGTCAATCAGAATTTG 
CTAATCAACTTAGTGACATTATTCAGACAGGAAAAACATTTTT 

MRTLFRMIFArPKFIFRLIWNIIWGIFKTVLVIAIILFGLYYYANHSQSEFANQLS 
30 DIIQTGKTF 



Sequence description 

35 

A] Length: 197 bp - 65 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgamo sequence. Possesses a 
leader peptide 



ID-158 
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Clone 2-c9 



ATGTCAAAAAAAATAATATTAGGAATTTTATCTCTTTTATCTGTCGTTACTT 
TGGTGGCGTGTGGTTCATCAGACAAACAGCTACAAGATAAAGTTGAGAAA 
AAAGGGAAGTTAGTTTTAGCGGTGAGTCCAGATTATGCTCCCTTTGAGTTT 

MSK.KIILG1LSLLSVVTLVACGSSDKQLQDKVEFCKGKLVLAVSPDYAPFEF 
Sequence description 

A] Length: 153 bp - 51 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide (not in frame with nuc) 



ID-159 



Clone 2-clO 



ATGAAAAATCAAAGACTATTACTGCTTTTTGGAGGCTTATTAATAATGATA 

ATGATGACAGCATGTAAGGATTCAAAAATCCCAGAAAACCGCACGAAAAA 

GGAATACCAGGCAGAACAGAATTTTAAGTCATACTTT 

MKNQRLLLLFGGLLIMIMMTACKDSKIPENRTKKEYQAEQNFKSYF 



Sequence description 



A] Length: 139 bp - 46 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide 
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ID- 160 



Clone 2-cl 1 

5 

ATGATTGGAAAATTATATTATAGCTATAGAAAGTCACGCTTATTAAGAAGT 
ATTTTATGGCTTATTTTAATTGTTGGTGTATATATGTTAGGACAACGTGTTT 
TATTATCCACTGTTCCTTTATCACATCAAGAGATAAAACTAGCAGTAGATC 

1 0 AACATTTACTC A AT AACTTTTCAGC AGTAAGTGGTGGG AGTTTTAATAAAT 
TAAATGTTTTCACACTGGGGTTGAGTCCATGGATGTCAAGTATGATTATTT 
GGAGATTCGTTTCCTTATTTTCGTGGGCAAAAAATGCAACGAAGCGAAAA 
GCAGAAGTAGCTCAATATACTTTAATGCTTACTATCTCAGTTATACAAGCA 
TATGGTGTTTCAGGAAATCAATTTATAAAAAGCTCTTTATTAGGTTCTTATA 

1 5 GTGATATTGTTTTT 

MIGKLYYSYRKSRLLRSILWLILIVGVYMLGQRVLLSTVPLSHQEIKLAVDQHL 

LNNFSAVSGGSFNXLNVFTLGLSPWMSSMIIWRFVSLFSWAKNATKRKAEVA 

QYTLMLTISVIQAYGVSGNQFIKSSLLGSYSDIVF 

20 

Sequence description 



25 A] Length: 423 bp - 141 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide 



ID-161 



35 Clone 2-cl3 



ATGAAAGGTCTATTGGATTTTTTAGTTAATATTGCCAGAACGCCAGCTATT 
TTAGTCGCCTTGATAGCCATTATCGGTTTAGTACTGCAGAAAAAAGGTGTT 
40 CCTGATATTGTAAAAGGTGGAATAAAAACATTTGTTGGCTTCTTAGTGGTT 
TCTGAAGGTGCAGGGATAGTCCAAAATTCCTTGAATCCATTTGGAAAAATG 
TTTGAACATGCTTTTCATTTGGTGGGGGTAGTTCCTAATAATGAAGCCATT 
GTAGCAGTAGCTCTTACGAAGTATGGCTCAGCAACTGCTTTGATTATGTTA 
GCGGGAATGATTTTTAATATTTTAATTGCTCGTTTTACAAAA 



, . u.n.i •!».(• n™5 ">i> : >«ii.. i«iP "«-P 
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MKGLLDFLVNIARTPAILVALIAIIGLVLQKKGVPD1VKGGIKTFVGFLVVSEG 
AGIVQNSLNPFGKMFEHAFHLVGVVPNNEAIVAVALTKYGSATALIMLAGM1 
FNILIARFTK 



Sequence description 



10 A] Length: 348 bp - 1 16 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
Shine-Dalgarno sequence. Possible leader 
peptide 

15 

ID- 162 



Clone 2-c21 

20 

TTGGTTGGTAAGCCCCAATTACTATTTTTAGATGAACCTACTTCCGGAATG 
GATACTTCCACACGTCAACGATTTTGGAAGCTGGTTGCGACACTAAAAAA 
AGAAGGTGACACAATTGTCTATTCTAGTCATTATATCGAAGAGGTAGAAC 

25 ATACAGCTGATAGGATTTTAGTACTTCATAAAGGAAAGTTATTACGCGATA 
CAACCCCCTTTGCCATGAAGCAAGAAAAAACCGAAAAGTTATTCACCGTT 
CCGCTTAGTTATCAAAAATTATTACCTACCTATTTGATTACAGAGTGTGAA 
GCCAAGAGTGATAGTATAACGTTTGTTACTGGGGAGGCTGAAACTGTATG 
GAAAATACTGGCAGATAATGGTTGTCCTATTGAAGCTATTGAGATGACCA 

3 0 AT AG A ACTTTGTT AAATCGT ATTTTTG AG ACT ACT A AG G AGGT AA A AC ATG 
AGAATCTTTA 

MVGKPQLLFLDEPTSGMDTSTRQRFWKLVATLKKEGDTIVYSSHYIEEVEHTA 
DRILVLHKGKLLRDTTPFAMKQEKTEKLFTVPLSYQKLLPTYLITECEAKSDSI 
35 TFVTGEAETVWKILADNGCPIEAIEMTNRTLLNRJFETTKEVKHENL 



Sequence description 

40 

A] Length: 462 bp - 155 aa (partial sequence) 

B] B] Putative TTG start codon is not preceded by 
an obvious Shine-Dalgarno sequence. No obvious 
leader peptide. N- and C- termini require further 



5 



25 



30 



35 
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examination. 

ID- 163 
Clone 2-c25 



TTGAAAAAATCCAAGAGAAGCCGTAAGGCAGTGACAACAAGTGGTGAGA 
1 0 AG ACTTTACTTG AGG ATTTGGC AA AAATGAATTTCCTAG ACG AAGTC ATT A 
ATGTTATGGTTTTATATACCTTGAATAAGACAAAATCTGCTAACTTAAATA 
AGGCCTATATCATGAAAGTTGCTAATGATTTTGCCTTTCAGAATGTTATGA 
CGGCCGAAGATGCTGTGCTTAAAATTCGTGATTTTTCAGATCAAAAAGTAA 
GGACTAAAACAGAAACGAAGAAGAAACAATCGAATGTTCCTGAATGGAGT 
1 5 AATCCTGATTATAAAGATGAGGTTAGCCC AG A AAAAGAA ATTGAATTAGA 
ACAGTTT 

MKKSKRSRKAVTTSGEKTLLEDLAKMNFLDEVINVMVLYTLNKTKSANLNK 
AYIMKVANDFAFQNVMTAEDAVLKIRDFSDQKVRTKTETKKKQSNVPEWSN 
20 PDYKDEVSPEKEIELEQF 



Sequence description 



A] Length:360 bp - 120 aa (partial sequence) 

B] N- and C- termini require verification. 



ID- 164 



Clone 2-c28 



ATGACGAATCATATTACTAAACTGATAGAAAATAGCGGAAAAAAATTGAC 
AGAAATTAGCGAAGCTACAGATATAGCCTATCCTACACTTTCTGGATACAA 
TCAAGGAATCCGCAAACCTAAAAAAGATAATGCTGAAAAATTGGCAAAAT 
40 ACTTTAATGTTTCCGTCGCTTACATTATGGGACTTGATAGCAACCCACATG 
CTCCATCAAATCTT 



MTNHITKLIENSGKKLTEISEATDIAYPTLSGYNQGIPvKPKKDNAEKLAKYFNV 
SVAYIMGLDSNPHAPSNL 



gQ^O/g ><5.i 3»iii 'i...!' . .fi,. i ; if" tit toTi 1 »,.f i'^p n,.? m.i 



Sequence description 

A] Length:218 bp - 72 aa (partial sequence) 

B] ATG start codon is preceded by an 
obvious Shine Dalgarno sequence. No obvious 
leader peptide. 



ID- 165 

15 



Clone 2-c29 



TTGATGAAAAGGAATAAACATTTACCGTTAACAGAAACTACCTATTATATT 
20 TTATTAGCTTTGTTTGAGGAAGCGCATGGCTATGCTATTATGAAAAAAGTT 
GAAGAAATGAGTGGCGGTGATGTTAGAATAGCCGCAGGGACAATGTACGG 
TGCCATTGAAAATTTACTTAAACAAAAATGGATAAAGTCTATCTCAAGTGA 
CGATAGAAGAAGAAAAGTTTATATTATTACTGAGACAGGAAAAGAAATAG 
TAGAACTTGAAACGAATCGATTAAGAAAGTTACTTAATACTGCTAATCAGT 
25 TG GGTTTTG G AG G AG ATGGTT ATG AT AA AGTTT 

MMKJWKHLPLTETTYYILLALFEEAHGYAIMKKVEEMSGGDVRIAAGTMYG 
AIENLLKQKWIKSISSDDRRRKVYI1TETGKEIVELETNRLRKLLNTANQLGFG 
GDGYDKV 

30 

Sequence description 



35 A] Length:337 bp - 1 12 aa (partial sequence) 

B] TTG start codon is preceded by an 
obvious Shine Dalgarno sequence. Actual start 
codon may ATG that comes immediately after the 
TTG. Potential leader peptide. 

40 



ID- 166 
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Clone 2-c35 



5 CCCATTACTGGTGAGTTAATAGCTGAGAAATTAGGAGTACCAAGAGCAGC 
ACTAAGGTCTGATTTGCGGGTTTTAAGTATGCTAGGTATCATAGATGCAAA 
ACCTAAGGTTGGTTATTTTTATTTAGGACAGTATCATGCTTCAATAGGGAC 
AAGTCATTTTGAAAAGATGACAGTTTCAGAAATTATGGGGATCCTTCTGAC 
AGTTCATCAAAAAGATTCAGTTTATGATGTTATTGTACATATTTTTATGGA 

1 0 AGATGCTGGTTGTGCTTTTATCTTGG ATGATGATG ATTTTCTCTGTGGAGTC 
GTGTCACGTAAAGATTTACTAAAAACCAGTATTGGCGGAGGAGATCTTTCT 
AAAATGCCAATAGGAATGGTGATGACACGTATGCCACACGTGACAACTGT 
TTTAGAAAATGAAAGTCTTTTTGCGGCAGCTGATAAATTAGTGAGCAGAA 
AAGTGGATAGTCTCCCTGTCGTTCGTCATGATAAGCAATATCCCGAAAAAT 

15 TTA 

PITGELIAEKLGVPRAALRSDLRVLSMLGIIDAKPKVGYFYLGQYHASIGTSHF 
EKMTVSEIMGILLTVHQKDSVYDVIVHIFMEDAGCAFILDDDDFLCGVVSRKD 
LLKTSIGGGDLSKMPIGMVMTRMPHVTTVLENESLFAAADKLVSRKVDSLPV 
20 VRHDKQYPEKF 



Sequence description 

25 

A] Length: 51 1 bp - 170 aa (partial sequence) 

B] N- and C-termini to be determined 



30 

ID- 167 



Clone 2-44 

35 

TTGGAAGTCATCATGCAATTTATTTATAGTATTATTGGTATTTTATTGGTAT 
TAGGAATTGTGTATGCAATTTCTTTCAATCGTAAGAGTGTTTCTCTAAGTTT 
AATTGGAAAAGCTCTTATCGTTCAATTCATTATTGCGCTAATCTTAGTACGT 
40 ATCCCACTAGGCCAACAAGTTGTTAGTGTTGTTTCAACTGGAGTTACTAAA 
GTAATCAACTGTGGTCAAGCTGGTTT 



MEV1MQFIYS1IG1LLVLGIVYAISFNRKSVSLSL1GKALIVQF11ALILVRIPLGQQ 
VVSVVSTGVTKVINCGQAG 
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Sequence description 

5 

A] Length:233 bp - 77 aa (partial sequence) 

B] TTG start codon is preceded by a 
possible Shine Dalgarno sequence. Actual start 
codon may occur further downstream. Potential 

10 leader peptide. 

ID- 168 



Clone 2-46 

15 

CAACCTAATAAAGCTTTAGAAAGTGATGAGATTGATATTAATGCTTTCCAG 
CATTATAATTACTTAACCAATTGGAATAAAGCAAATAAGACCAATCTTGTT 
TCCGTTGCTGAGACATACTTTACTTCCTTTAGATTATACTCTGGTACTAAGA 
20 ACGGTAAAGGTAAATACCAAACAGTTTCTGAAATTCCAAATAAAGCAACT 
ATTACTATCCCAAACGATGCAGTTAACGAAAGTCGCTCTCTCTACTTGTTA 
CAATCAGCAGGCTTGCTAAAATTGAAAGTATCAGGTGATACATTAGCAAC 
AATGTCAGATGTTGTTTCCAATCCTAAATCTTTAGATTT 

25 QPNKALESDErDINAFQHYNYLTNWNKANKTNLVSVAETYFTSFRLYSGTKN 
GKGKYQTVSEIPNKATITrPNDAVNESRSLYLLQSAGLLKLKVSGDTLATMSD 
VVSNPKSLD 



30 Sequence description 



A] Length:344 bp - 1 14 aa (partial sequence) 

B] N- and C- termini require verification 

35 



ID- 169 



40 Clone 2-47 



ATGAAATGTATAATAAATAATATAAATAAAATAAAAATGATAATTGAGAT 
TTATCATAGAAGGAAAACTATTTTGAAATTAAATAAAATCATATTATCTAC 



72/98 



TGCAGCTCTTACTGCTCTCTTTTTAGGATATAATAGCGTTACTGCGGATACA 
TATAATAACTATCAGCCACATAGATCAAATAATATGGATTTAACTGAGGA 
ATATAACTATAATAACCAGATAGAACTTCAGGAGCGTATAAAAAACCTAA 
ATATACCTTTT 



MKCI [NNINKIKMIIEIYH RRKT1LK.LN KJ1LST AALT ALFLG YNS VT ADT YNNY 
QPHRSNNMDLTEEYNYNNQIELQERIKNLNIPF 



20 ID- 169 



Clone 2-47 

25 

ATGAAATGTATAATAAATAATATAAATAAAATAAAAATGATAATTGAGAT 
TTATCATAGAAGGAAAACTATTTTGAAATTAAATAAAATCATATTATCTAC 
TGCAGCTCTTACTGCTCTCTTTTTAGGATATAATAGCGTTACTGCGGATACA 
TATAATAACTATCAGCCACATAGATCAAATAATATGGATTTAACTGAGGA 
30 ATATAACTATAATAACCAGATAGAACTTCAGGAGCGTATAAAAAACCTAA 
ATATACCTTTT 

MKCIINNINKTKMIIEIYHRRKTILKLNTCIILSTAALTALFLGYNSVTADTYNNY 
QPHRSNNMDLTEEYNYNNQIELQERIKNLNIPF 

35 



5 



10 



Sequence description 



15 



A] Length:264 bp - 88 aa (partial sequence) 

B] There is a Shine-Dalgarno sequence upstream 
of this sequence. Potential leader peptide 
sequence 



Sequence description 



40 



A] Length:264 bp - 88 aa (partial sequence) 

B] There is a Shine-Dalgarno sequence upstream 
of this sequence. Potential leader peptide 
sequence 



V3/98 *"* : '" ! '' " ^ " ™ "** 



ID- 170 



5 Clone RS-58b 



TTGGGTGATTATTATGGTAAGAAATATTTTGGTGAGGCAGCTAAAAAAGA 
CGTCGAACATATGGCTAAGAAAATCATTAATGTCTATAAAACACGGTTAA 

10 AAAACAACACTTGGTTATC 

AGAAAATACAAAAGCAATGGCCATTAAGAAACTTGATAACATGAGATTAA 
TGATTGGCTATCCAGAAGATTATCCTGATCTTTATCGTCAGTACCAATTTG 
ATAGTAAAGCAAGCTTCTTTGAAAACAATGATAACTACAGAAAATTATCG 
AACAAGAAAACATTTGAAGAATTTAACCAGTCTAATCAACGTGAACATTG 

1 5 GCAAATG AGTGCC AATGCTGTAAATGCTTATAATGATCCTAATACCAATTC 
CATAGTCTTTCCAGCAGCGATTTTTCAATCACCACTGTACGATAAAACTAA 
AACAGTTAGTCAAAATTATGGAGCTATCGGAGCAATTATTGGTCATGAAAT 
TTCACACTCATTTGATATTAATGGTATGAAATATGACGAGAAAGGGAATCT 
TCACGATTGGTGGACTAAAGAAGATTTAAATCATTATAAGAAATCAACAC 

20 AAGCTATGATTGACCAATGGGATGGCCTTAAAGCAGATGGCGGTAAAGTT 
GATGGTAAATTAACTTTAGCAGAAAATATTGCAGATAATGGTGGTGTTATG 
GCATCTCTAGAAGCTCTTAAGACTGAAAAAATCCAAACTATAAAGAATTTT 
TTGAATCATGGGCAAGTATTTGGCGTCAAAAAGCAACCAAAGAACAAAGT 
AAGTCCTCAATTCAGTCAGATGTTCATGCACCATATGAATTGA > 

25 GAGCTAACATCCCAGTACGTAATTTCCAAGAATTTTATGATGCCTTTGGTG 
TTAAAAAAGGCGATTCAATGTATCTAAAACCAGAAAAACGTTTGACACTTT 
GGTAA 



30 MGDYYGKKYFGEAAKKDVEHMAKKIINVYKTRLKNNTWLSENTKAMAIKK 
LDNMRLMIGYPDYPDLYRQYQFDSKASFFENNDNYRKLSNKKTFEEFNQSNQ 
REHWQMSANAVNAYNDPNTNSIVFPAAIFQSPLYDKTKTVSQNYGAIGA1IGH 
EISHSFDINGMKYDEKGNLHDWWTKEDLNHYKKSTQAMIDQWDGLKADGG 
KVDGKLTLAENIADNGGVMASLEALKTEKIQTIKNFLNHGQVFGVKKQPKNK 

35 VSPQFSQMFMHHMN* 

Sequence description: 

A] Length: 819 bp - 272 aa (full length gene) 
40 (107 bp of additional DNA sequence (> onwards) is 

also included. While not in-frame with the 
described orf, it also shares strong homology 
with the neutral peptidases. 



y^y^Jg uifin ">„;'' 'i.,-t> Hh'? Mu» n „„.r «.,„'" Tii 'in.? iw, 



B] This gene sequence was not identified using the LEEP system. It was identified 
downstream of the ID-89 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-89 gene sequence. ID-89 and 1D-170 together 
show homology over their combined entire length with the neutral endopeptidases 
from Lactococcus and Lactobacillus. Possesses TTG (possible ATG start codon 
located 13 bp further downstream) start codon with no obvious signal peptide. Shine 
Dalgarno sequence not immediately obvious. Possibly located further downstream 



ID-171 



Clone 2-1 8/22b (Mod2) 



ATGACCATGATTACGCCAAGCTTCATTAAGGTATCTCTAGATGAAACAAAT 

CGTATGATGCGTATGATATCAGATTTATTAAGTTTATCGCGCATTGATAAT 

GAAGTAACGCATTTAGATGTTGAAATGACGAATTTTACAGCTTTCATGACC 

TCAATTTTGAATCGATTTGATCAGATTAGAAATCAAAAAACAGTCACAGG 

AAAAGTTTATGAAATTGTCAGAGATTATCCTCTTAAGTCAATTTGGGTGGA 

AATTGATACAGATAAGATGACTCAAGTGATTGATAACATTTTAAATAATGC 

AGTCAAGTATTCACCAGATGGTGGTAAGATTACAGTTAATCTACGCACAAC 

TAAAACGCAGATGATTTTATCAATATCAGACCAAGGCTTAGGTATTCCCAA 

AAAAGATTTACCTCTCATTTTTGATCGTTTTTATCGTGTTGATAAGGCGAGA 

AGTCGTCAACAGGGTGGGACTGGACTTGGTTTGTCAATTGCAAAAGAAAT 

TGTTAAGCAGCATAAGGGATTTATTTGGGCTAAGAGTGAGTATGGTAAAG 

GGTCTACTTTTACAATCGTCTTGCCTTATGATAAAGATGCTGTAACTTATGA 

AGAATGGGAGGACGTTGAAGATTAA 



MTMITPSFIKVSLDETNRMMRMISDLLSLSRIDNEVTHLDVEMTNFTAFMTSIL 
NRFDQIRNQKTVTGKVYEIVRDYPLKSIWVEIDTDKMTQVIDNILNNAVKYSP 
DGGKITVNLRTTKTQMILSISDQGLGIPKKDLPLIFDRFYRVDKARSRQQGGTG 
LGLSIAKE1VKQHKGF1WAKSEYGKGSTFTIVLPYDKDAVTYEEWEDVED* 



Sequence description: 

A] Length: 613 bp - 212 aa (full-length gene possibly) 

B] Possible Shine Dalgarno sequence present 
upstream of a ATG start codon. May not have yet 
determined the N- portion of this gene. No 
obvious signal peptide. 



.= .r~ii .r-i. g-jt s^.- sr*?. =j^- ,;n> <.-~-ji 
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ID-172 

5 

Clone 2-54balternate (107b) 



TTGAAAAAAATTATTACTTCTATTCTATTACTTAGTTGCATTTTTTTTATGC 

1 0 C AACC ATCTCTGCTG AATCTTTTAATGCTTCCGCTAAACATGCCTTAGCAGT 
TGATTTAGATTCAGGAAAAATCTTGTATGAAAAAGATGCTAACAAACCCG 
CTGCTATTGCTTCCTTGACTAAAATAATGACCGTTTATATGGTCTATAAAG 
AAATTGATAACGGTAACCTCAAGTGGAATACCAAAGTAAATATATCTGAC 
TACCCTTATCAACTAACACGCGAATCTGATGCTAGTAATGTTCCTTTAGAA 

1 5 AAAAGGCGCTATACTGTTAAACAACTCGTGGACGCTGCCATGATTTCTAGT 
GCTAACAGTGCAGCCATTGCTTTAGCTGAACATATTTCAGGAACTGAAAGT 
AAATTTGTTGATAAAATGACTGCTCAATTGGAAAAGTGGGGAATTCATGAT 
AGCCACCTAGTCAATGCTTCTGGCTTAAATAATAGTATGTTAGGCAATCAC 
ATTTATCCAAAATCGTCACAAAACGACGAAAATAAAATGAGTGCACGTGA 

20 TATTGCTATTGCTGCCTACCATTTGGTCAACGAATATCCTTCCATTCTTAAG 
ATTACTAGTAAGTCCGTTGCTAAATTTGATAAAGATATTATGCATTCTTAT 
AACTACATGCTACCAGATATGCCTGTCTTTAGACCAGGTATTACAGGTTTG 
AAAACTGGGACAACGGAATTAGCTGGCCAATCTTTTATTGCTACATCTACT 
GAAAGTGGAATGAGACTACTCACTGTTATTATGCATGCTGATAAGGCCGAT 

25 AAAGACAAATATGCTCGCTTTACAGCAACTAACTCTCTCTTGAACTATATC 
ACAAACACCTACGAACCTAACCTTGTATTAGCTAAAGGAGCTGCATATAA 
AGGTAAAGAAGCAAGTGTGAGAGACGGAAAAGAACAATCGGTCATCGCT 
GTTGCTAAAAACGATTTGAAAGTAGTACAGAAGAAAAATATCACTAAACA 
AAATCAGTTAAAAATTAACTTTAAAAAAGAGCTTACTGCTCCTATTACAAA 

30 AAAAGAGAACCTAGGGAAAGCTTATTACGTTGACCTTAATAAGGTTGGAA 
AAGGCTATCTCATAAAGGAACCTAGCGTTCATTTAGTGGCAAAAGATAGT 
ATTGAGCGCAGTTTCTTCCTCAAAGTGTGGTGGAATCATTTTGTGCGCTAC 
GTTAACGAAAAACTTTAA 

35 

MKKIITSILLLSCIFFMPT1SAESFNASAKHALAVDLDSGKILYEKDANKPAAIA 
SLTKFMTVYMVYKE1DNGNLKWNTKVNISDYPYQLTRESDASNVPLEKRRYT 
VKQLVDAAMISSANSAAIALAEHISGTESKFVDKMTAQLEKWGIHDSHLVNA 
SGLNNSMLGNHIYPKSSQNDENKMSARDIAIAAYHLVNEYPSILKITSKSVAKF 
40 DKDIMHSYNYMLPDMPVFRPGITGLKTGTTELAGQSFIATSTESGMRLLTVIM 
HADKADKDKYARFTATNSLLNYITNTYEPNLVLAKGAAYKGKEASVRDGKE 
QSVIAVAKNDLKVVQKKNITKQNQLKINFKKELTAPITKKENLGKAYYVDLN 
KVGKGYLIKEPSVHLVAKLDSIERSFFLKVWWNHFVRYVNEKL* 



'-[' f*l! lT~:j iiPa "* tiTh S"!i "T r iT'S ""-l*:: 1 ^ 5*5 
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10 



Sequence description: 

A] Length: 1236 bp - 412 aa (full-length gene sequence possibly) 

B] A possible Shine-Dalgarno sequence precedes the putative 'TTG' start 
codon. (needs further cloning and sequencing to verify N-terminus) 



ID- 173 
Clone 3-60b 



ATGACGCTTCGAGAATTAACAATAGAAGAATTTAAAGAACATTCAGGAAA 
1 5 TTATGATTC AC AATC ATTTTTACAAACACCTGAGATGGCTAAACTTTTAGA 
AAAACGCGGCTATGATGTTAGGTATTTGGGATATCAAGTAGAAAATAAAC 
TAGAGATAATCAGTTTATCTTATATTATGCCAGTCACTGGTGGTTTTCAAAT 
GAAAATTGATTCAGGACCAGTTCATTCAAATTCTAAGTATCTAAAACAATT 
TTATAAAGCATTGCAAGGCTATGCCAAATCCAACGGTGTTCTAGAATTAAT 
20 AGTTGAGCCTTTTGATGATTACCAATTATTCACTAGTTCGGGAGTTCCTAGT 
AATCAGGGAAATGATAATCTGATTGAAGATTTTACCAGTTCAGGTTATCAC 
CATGATGGTTTAACAACTGGTTTTACTGGTAAATATTTATCTTGGCACTATG 
TTAAAAATTTAGAAGGTGTCACTTCTGAAACGTTACTATCTTCATTCTCTAA 
GACAGGACGAGCTTTGGTTAAGAAAGCAATGTCTTTTGGAATCAAGGTTC 
25 GCGTTCTTAAACGTGATGAGCTACATTTATTTAAAGAGATAACAACTTCTA 
CGTCAAATAGACGTGATTATATGGATAAGTCCTTAGATTATTATCAAGATT 
TTTACGATAGCTTTGAAGGCAAGGCTGAATTTGTGATTGCCACTTTAAATT 
TTAGAGAATACGACCATAACTTGCAAATAAAAGCTGAAGCATTGGAAAAT 
AAGCTT 

30 

MTLRELTIEEFKEHSGNYDSQSFLQTPEMAKLLEKRGYDVRYLGYQVENKLEI 
ISLSYIMPVTGGFQMKIDSGPVHSNSKYLKQFYKALQGYAKSNGVLELIVEPF 
DDYQLFTSSGVPSNQGNDNLIEDFTSSGYHHDGLTTGFTGKYLSWHYVKNLE 
35 GVTSETLLSSFSKTGRALVKKAMSFGIKVRVLKRDELHLFKEITTSTSNRRDY 
MDKSLDYYQDFYDSFEGKAEFV1ATLNFREYDFTNLQTKAEALENKL 



40 Sequence description 

A) Length: 771 bp - 257 aa (partial gene sequence) 

B) This gene sequence was not identified using the LEEP system. It was 
identified immediately downstream of the ID-65 gene which was identified by 



.„fi.. "..,!>' n...r .„!i„ !*[«,? S, ti j: j" m S.„K „.,.,. „,1 *i. .? iw 
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LEEP, during cloning and sequence analysis of the full-length ID-65 gene 
sequence. Sequence Characteristics: 
No obvious leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgarno sequence. 



ID-174 

Clone 2-17b(ID-80b) 



TTGTCATTAAGTTTGGTTGCAGTGTTAAATCTTATCCCTCCTAAAATCATGG 

GATCAGTTATTGATGCTATTACAACTGGAAAATTAACAAGACCACAATTAC 

TATGGAATTTATTAGGTTTGGTTTTGTCAGCTTTAGCTATGTATGGGCTGCG 

TTATATTTGGCGTATGTATATTTTAGGGACTTCTTACAAATTAGGCCAAGTT 

GTCAGATACCGTTTATTTGAACATTTTACAAAAATGTCTCCTTCTTTTTATC 

AGAAATATCGTACAGGTGATTTAATGGCGCACGCGACCAACGACATCAAT 

TCTCTAACACGTCTTGCAGGAGGAGGAGTTATGTCAGCAGTGGATGCCTCT 

ATCACAGCATTAGTAACGCTTATCACCATGTTCTTTACTATTTCGTGGCAA 

ATGACATTAATTGCGGTTATCCCTTTGCCCTTAATGGCCTTAGCACTAGTA 

AATTGGGGCGAAAAACCCATGAAACCTTCAAAGAATCTCAGGCAGCCCTT 

TTCAGAATTAAATAATAAAGTG 

MSLSLVAVLNLIPPKIMGSVIDAITTGKLTRPQLLWNLLGLVLSALAMYGLRYI 
WRMYILGTSYKLGQVVRYRLFEHFTKMSPSFYQKYRTGDLMAHATNDINSLT 
RLAGGGVMSAVDASITALVTLITMFFTISWQMTLIAVIPLPLMALALVNWGEK 
PMKPSKNLRQPFSELNNKV 



Sequence description 

A) Length: 534 bp - 178 aa (partial gene 
sequence) 

B) This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-80 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-80 gene sequence. 
Sequence Characteristics: 

No obvious leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgarno sequence. 



.j^j jpj, T ;j; tjvi, ,p|j -w-Jj* £ -'V ,pj; ,f-^ jpfc vwrj. 
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ID-175 



5 Clone 2-1 lAb(ID-l 03b) 



ATGCATATTGAGACTGTTATTGATTTCAAAGAATTAGGAAAAAGATATCGT 
TTTAAAAATCCTACAAAAGAATTAATAGCTGATACTTTAGAACAAGTCTTA 

1 0 GAAGTG AT AAAAG AAGTTG ATT ATT ATC AATCTC AAA ATTATTATGTTGTT 
GGTTATTTATCTTATGAAGCATCTGCTGCTTTTGATTCACATTTTAAAGTTT 
CTCAACAGAAGTTGGCTGGAGAACATCTAGCTTATTTTACAGTACATAAAG 
ATTGTGAGAACGAAGCTTTTCCTTTAAGTTATGAAAATGTTAGATTAGCAG 
ATAATTGGACTGCTAATGTTTCTGAGCAAGAATATCAAGAGGCAATTGCTA 

1 5 ATATTAAAGGAC AAATTAGACAAGGAA ATACTTATCAAGTAAATTATAC A 
CTAGAGCTTAGCCAACAATTATGCTCGGATCC 



MHIETVIDFKELGKRYRFKNPTKELIADTLEQVLEVIKEVDYYQSQNYYVVGY 
20 LSYEASAAFDSHFKVSQQKLAGEHLAYFTVHKDCENEAFPLSYENVRLADNW 
TANVSEQEYQEAIANLKGQIRQGNTYQVNYTLELSQQLCSD 



Sequence description: 

25 

A] Length: 440 bp - 146 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-103 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-103 gene sequence. 

30 Shine Dalgarno sequence present upstream of 

ATG start codon, No apparent leader peptide sequence 

ID- 176 

35 

Clone 2-18/22b(b) (ID- 104b) 



40 GTGAATAATATGTTTTATCTCAAAATAGCCTGGCATAATTTAAAACATTCT 
ATAGACCAGTACATACCATTCCTCTTAGCCAGTTTATTACTTTATTCATTGA 
CTTGTTCTACGCTACTAATCTTAATGAGTGCTGTTGGAAGAGATATGGGGA 
CAGCGGCAACGGTTCTTTTTCTTGGAGTGATTGTTTTGTCAATCTTTGCGGT 
AGTCATGGAACATTATAGCTACAATATCTTGATGAAACAGCGTAGTAGTG 



wRu W <„n„ It f MmP if HI - li * >..!!■ *>„i» 1m.!> lint" >lm, 
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AATTTGGACTGTATAACATTTTGGGGATGAATAAACGTCAAGTTGCGCGTG 
TAGCTAGTCTAGAGCTGTTTATTATTTATATATTTCTTATTTCTATAGGAAG 
TCTGTTTAGTGCTTTTTTTGCTAAATTTATTTATTTAATTTTTGTCAACATTA 
TTAACTATCATGCACTAAATCTTAGTTTAAGTTTATGGCCATTTATTATTTG 
5 TATCGTTATATTTACAGGTATTTTTCTGACTTTAGAAGTTCCAGTTATTCGA 
CATGTTCATTTATCATCCCCATTAAGTCTTTTTAGAAAGAAACAACAGGGA 
GAAAAAGAACCAAAAGGTAATCTTATACTTGCAATTTTAGCGTTAGTAGCT 
ATCGCCATCGCTTATACAATGGCTCTTACTTCAGGTAAAGCACCTGCATTA 
GCTGTTATCTATCGTTTCTTCTTTGCAGTACTTTTAGTAATTGCTGGTACTT 

1 0 ATCTTTTTTATATTAGTTTTATGACATGGTACTTAAAAAGGTTGCGTCAAAA 
CAAGCATTATTATTATAAATCTGAGCATTTTGTATCAACTTCGCAAATGAT 
TTTTCGAATGAAGCAAAATGCAGTAGGGTTAGCAAGTATCACTTTATTAGC 
TGTTATGGCTCTAGTTACTATTGCTACAACAGTCTCACTCTATTCAAATACA 
CAAAATGTTGTTACCGGACTATTTCCAAAATCAGTAAGTTTATCAATAGAT 

1 5 AATTCAAAAGGTGACGCGAAAAATATATTTGAAGAAAAGATTTTGAAGAA 
ACTAGGTAAGTCATCTAAGGAAGCTATCACTTATAATCAGACAATGATTTC 
GATGCCAGTTAGTCAATCAAGTGACTTAATATCACATCTA 



20 

MNNMFYLKIAWHNLKHSIDQY1PFLLASLLLYSLTCSTLLILMSAVGRDMGTA 
ATVLFLGVIVLSIFAVVMEHYSYNILMKQRSSEFGLYNILGMNKRQVARVASL 
ELFIIYrFLISIGSLFSAFFAKFIYLIFVNIINYHALNLSLSLWPFIICIVIFTGIFLTLE 
VPVIRHVHLSSPLSLFRKKQQGEKEPKGNLILAILALVAIAIAYTMALTSGKAP 
25 ALAVIYRFFFAVLLVIAGTYLFYrSFMTWYLKRLRQNKHYYYKSEHFVSTSQM 
IFRMKQNAVGLASITLLAVMALVTIATTVSLYSNTQNVVTGLFPKSVSLSIDNS 
KGDAKNIFEEKILKKLGKSSKLEAITYNQTMISMPVSQSSDLISHL 



30 

Sequence description: 

A] Length: 1 1 19 bp - 373 aa (partial gene sequence) 
35 B] This gene sequence was not identified using the LEEP system. It was 

identified upstream of the ID-104 gene which was identified by LEEP, during 

cloning and sequence analysis of the full-length ID-104 gene sequence. 

Possible Shine Dalgarno sequence present 

upstream of a GTG start codon. Possesses a potential 
40 leader peptide sequence 



ID- 177 



Clone 2-5b (ID-1 12b) 

5 ATGGTTGAGCCAATTATTTCAATACAAGGACTTCATAAAAGTTTTGGGAAA 
AATGAGGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAGAAGTGGT 
GGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTTAAGAACAAT 
GAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGATTG 
ATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGC 

1 0 ATGGTTTTTCAAC AGTTCAATCTATTTCCCAATATG ACTGTACTAGA AAAT 
ATTACTTTATCACCTATTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAG 
ACAAAAGCATACGAGCTACTTGAAAAAGTTGGACTCAAAGAGAAGGCTAA 
TGCTTATCCAGCAAGCTTATCTGGAGGACAACAACAACGGATTGCTATTGC 
AAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAACCTACTTCA 

1 5 GCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTA 
GCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCA 
CGTGAAGTAGCGGATCGTGTCATTtTTATGGATGCAGGGATTATTGTTGAG 
CAAGGGACCCCTAAGAAAGTATTTGAGCAGACAAAAGAAATCCGCACAAG 
AGACTTCTTAAGTAAAGTATTATAA 

20 

MVEPIISIQGLHKSFGKNEVLKGIDLDIHQGEVVVIIGPSGSGKSTFLRTMNLLE 
VPTKGTVTFEGIDITDfCKNDIFKMREKMGMVFQQFNLFPNMTVLENITLSPIKT 
KGLSKLDAQTKAYELLEKVGLKEKANAYPASLSGGQQQRIAIARGLAMNPDV 
25 LLFDEPTSALDPEMVGEVLTVMQDLAKSGMTMVIVTHEMGFAREVADRVIF 
MDAGIIVEQGTPKKVFEQTKEIRTRDFLSKVL* 



30 Sequence description: 

A] Length: 735 bp - 244 aa (full length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-1 12 gene which was identified by LEEP, 

35 during cloning and sequence analysis of the full-length ID-1 12 gene sequence. 

Shine-Dalgarno sequence precedes the 'ATG' 
start codon. No obvious leader peptide 

40 ID- 178 



Clone 2-5c (ID- 11 2c) 



ATGTCTCAsTATCAAGAGTGGTTAGAAAACGACTCACTCGGTAAAGATATT 
AAGTCAGATTTAGAAGCTATTAAAGGAGATGAATCTGAAATTCAGGATCG 
TTTTTACAAAACATTAGAATTTGGAACGGCGGGATTGAGAGGTAAACTTG 
GAGCAGGAACCAATCGTATGAATACTTATATGGTGGGGAAAGCAGCACAA 
5 GCATTAGCTAATCGATTATTGATCATGGCCCTGAAGCTATTGCACGTGGAA 
TTGCAGTTAGTTATGATGTCCCGTTATCAATCTAAGGAATTTGCAGAATTA 
ACTTGGTCCATTATGGCAGCAAATGGTATTAAAGCCTTATATTTA 

MSHMNYKE1YQEWLENDSLGKDIKSDLEA1KGDESEJQDRFYKTLEFGTAGLR 
10 GKLGAGTNRMTMTYMVGKAAQALANRLLTMALKLLHVELQLVMMSRYQSKE 
FAELTWSIMAANG[KALYL 



Sequence description: 

15 

A] Length: 366 bp - 122 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-1 12 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-1 12 gene sequence. 

20 Shine-Dalgamo sequence preceded the 'ATG' 

start codon. No obvious potential leader peptide sequence. 

ID- 179 

25 

Clone 2-5d (ID-1 1 2d) 

ATGCAACCTGTAAAAGTCGATGAACCTTCTGTTGAAGAAACCATTACTATT 
TTGAAAGGTATCCAAAAAAAATACGAAGATTATCATCACGTAAAATATAA 

30 TAATGATGCCATAGAAGCAGCTGCAGTACTATCTAATCGTTATATCCAAGA 
CCGCTTTTTACCTGATAAAGCAATAGACTTATTAGATGAAGCTGGTTCTAA 
AATGAACCTAACACTAAATTTTGTTGATCCAAAAGAAATTGATCAACGTCT 
CATTGAAGCAGAAAATTTAAAAGCGCAAGCGACTCGTGAAGAAGATTACG 
AACGTGCAGCTTACTTCCGTGACCAGATTGCAAAATATAAAGAAATGCAG 

3 5 C AAC AAAAGGTCG ACG ATC A AGATAC ACCT ATTATTACCGAAAAAAC AAT 
TGAGCACATCATTGAAGAAAAAACGAATATCCCTGTTGGTGATTTAAAAG 
AAAAAGAACAATCTCAATTAATTAATCTCGCAGATGACTTGAAACAGCAT 
GTGATCGGCCAGGATGACGCTGTCATTAAGATTGCAAAAGCTATTCGTCGT 
AATCGAGTTGGTCTTGGTAGCCCAAACCGTCCTATTGGTTCCTTTTTATTTG 

40 TAGGACCAACCGGTGTTGGTAAAACTGAACTTTCTAAACAACTAGCAATTG 
AGCTCTTTGGTTCAGCTGATAGTATGATTCGTTTTGATATGTCAGAGTACAT 
GGAAAAGCATGCTGTTGCTAAATTAGTCGGAGCGCCTCCAGGATACGTGG 
GATACGAGGAAGCTGGACAACTAACTGAAAAGGTTCGTCGAAATCCTTAC 
TCGCTCATCCTTCTAGATGAAATTGAAAAAGCTCATCCCGATGTCATGCAT 
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ATGTTCTTGCAGGTCCTTGATGACGGTCGATTAACAGATGGACAAGGAAG 
AACTGTTAGTTTTAAAGATACCATTATCATCATGACCTCAAATGCTGGTTC 
TGGTAAAACTGAAGCAAGTGTTGGCTTTGGTGCCTCACGAGAAGGTAGGA 
CGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCAT 
5 GCAAGC 

MQPVKVDEPSVEETITILKGIQKKYEDYHHVKYNNDArEAAAVLSNRYTQDRF 
LPDKAIDLLDEAGSKMNLTLNFVDPKEIDQRLIEAENLKAQATREEDYERAAY 
FRDQIAKYKEMQQQKVDDQDTPI1TEKTIEHIIEEKTNIPVGDLKEICEQSQL1NL 
1 0 ADDLKQHVIGQDDAVIKIAKAIRRNRVGLGSPNRPIGSFLFVGPTGVGKTELSK 
QLA1ELFGSADSMIRFDMSEYMEKHAVAKLVGAPPGYVGYEEAGQLTEKVRR 
NPYSLILLDEIEKAHPDVMHMFLQVLDDGRLTDGQGRTVSFKDTIIIMTSNAGS 
GKTEASVGFGASREGRTNSSSVPGDPLESTCRHAS 

15 



Sequence description: 

20 

A] Length: 1070 bp y 356 aa (Partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID-1 12 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID-1 12 gene sequence. Shine- 

25 Dalgarno sequence preceded the 'ATG' 

start codon. No obvious potential leader peptide 
sequence. 

ID- 180 

30 

Clone 2-7b (ID-1 13b) 



ATGAGAGGGAAGGTTATTTACGGCACAACCCTTATAGGTCTTTTTCTATTC 
3 5 TT ATTTTTCT ATTTTTGG ATTCCT AAGC ATC AC ATCG AG AG AAT AC ATC ATC 
ATCGTATAAAGCAGGTAGATGCGAAGAGTGATTTAACAGGATTTAAAACC 
CATTTGCCCATTATCAGCATTGATACAAAGCAACAAGTTATTCCTCTTGTT 
ACAAAAGAAGGCGGAAAATATGTCAAAGCTAGGGATAATATTAATGTTGA 
TATCGAATTACGGGATTCTCCAAGTAGATCACATCATTTATCAGAAAAGCC 
40 GAGAATTAGGACAAAAGGGTTAATATCATATAGAGGAAATTCCTCTCGTT 
ACTTTGATAAGAAGTCATTGAAAGTTAAGTTTGTTACTAATAAGTTAAAGG 
AAAAGAAGCATCGATTAGCAGGAATGCCTAAAGAATCGGAGTGGGTATTG 
CATGGTCCCTTTCTAGACAGAACATTATTAAGAAATTATCTGAGTTATAAT 



ATTGCTGGTGAGATTATGCCTATGCCCCAAACGTTCGCTACTGTGAGTTAT 
TTGTCAATGGTGAGTATCAGGGAG 



MRGKVIYGTTLIGLFLFLFFYFWIPKHH1ERIHHHRIKQVDAKSDLTGFKTHLPII 
5 SIDTI<CQQVLPLVTICEGGKYV1CARDN1>JVDIELRDSPSRSHHLSEKPR[RTKGL1S 
YRGNSSRYFDKLKSLKVKLFVTNKLPCEKKHRLAGMPKESEWVLHGPFLDRTLLR 
NYLSYNIAGEIMPMPQTFATVSYLSMVSIRE 



10 

Sequence description: 

A] Length: 582 bp - 194 aa (Partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 

1 5 identified downstream of the ID-1 1 3 gene which was identified by LEEP, 

during cloning and sequence analysis of the full-length ID-1 13 gene sequence. 

ATG start codon is preceded by a Shine- 

Dalgamo sequence-Possesses a potential leader peptide 

sequence. C-terminus to be determined. 

20 



1D-181 

25 

Clone 2- 17b (ID- 11 7b) 



CTTCACATTTTATTGATCACTATCTGACAAATGTTAATCAAACAGCAGTTCT 
30 TATTTTAGTGGGATATTATTCAATGTATGTCTTGCAGACCTTAATTCAATAT 
TTTGGGAATCTCTTTTTTGCGCGTGTTTCTTATAGTATTGTTAGAGATATTC 
GTAGAGATGCTTTTGCTAATATGGAAAGGCTAGGCATGTCTTATTTTGATA 
GGACACCGGCAGGATCTATTGTGTCACGTATTACTAATGATACTGAAGCAA 
TATCTGATATGTTTTCGGGTATTTTATCAAGTTTTATCTCGGCGATATTTAT 
3 5 TTTT AC AGTTACTCTGTACACT ATGTTGATGCT AGAC ATTAAACTAAC AGG 
ACTCGTCGCTCTTTTGTTACCTGTTATCTTTATATTAGTGAATGTCTATCGG 
AAAAAATCAGTCACTGTCATTGCTAAAACGAGAAGTTTACTTAGTGATATC 
AACAGTAAATTATCAGAAAGTATTGAAGGAATTC 

40 SHFIDHYLTNVNQTAVLILVGYYSMYVLQTLIQYFGNLFFARVSYSIVRDIRRD 
AFANMERLGMSYFDRTPAGSIVSRITNDTEAISDMFSGILSSFISArFIFTVTLYT 
MLMLDIKLTGLVALLLPVIFILVNVYRKKSVTVIAKTRSLLSDINSKLSESIEGI 



84/98 



Sequence description: 

A] Length: 498 bp - 165 aa (Partial gene sequence) 
5 B] This gene sequence was not identified using the LEEP system. It was 

identified downstream of the ID-1 1 7 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-1 17 gene sequence. 
N- and C-termini have yet to be determined 

10 



ID-1 82 

15 



Clone 3-8b (ID- 1 20b) 



ATGTACCATATTGAATTAAAAAAGGAAGCTTTACTACCAAGAGAACGCCT 
20 AGTTGATTTAGGCGCAGATAGATTGAGTAATCAGGAGTTATTAGCCATTCT 
CTTACGTACAGGTATTAAAGAAAAACCTGTTCTTGAAATTTCAACGCAAAT 
TTTAGAAAACATAAGCAGTTTAGCAGATTTTGGTCAATTATCCTTACAGGA 
GTTGCAATCCATTAAAGGAATCGGTCAGGTTAAATCCGTCGAAATAAAAG 
CTATGCTAGAACTAGCAAAACGGATTCACAAAGCTGAATATGATCGTAAA 
25 GAGCAAATTTTAAGTAGTGAACAATTAGCGAGGAAAATGATGCTCGAATT 
AGGGGATAAAAAACAAGAACATTTAGTAGCTATTTATATGGATACACAAA 
ATCGTATTATCGAACAGAGAACTATTTTTATTGGTACTGTACGTCGTTCAG 
TAGCAGAGCCAAGAGAAATTCTACATTATGCTTGTAAAAACATGGCAACT 
TCTTTGATTATTATACATAATCATCCCTCAGGTTCTCCAAATCCCAGTGAAA 
3 0 GTG ATTT A AGTTTC ACT AAA AAAAT AA A ACG ATC ATGTG ATC ATCTGGGAA 
TTGTCTGCCTAGATCACATCATCGTTGGAAAAAATAAATATTATAGTTTTC 
GAGAAGAAGCAGATATTTTATAA 

MYHlELfCKEALLPRERLVDLGADRLSNQELLATLLRTGTKEKPVLETSTQTLENT 
35 SSLADFGQLSLQELQSIKGIGQVKSVEIKAMLELAKLRIHKAEYDRKEQILSSEQ 
LARKMMLELGDKKQEHLVAIYMDTQNRIIEQRTIFIGTVRRSVAEPREILHYAC 
KNMATSLIIIHNHPSGSPNPSESDLSFTKKIKRSCDHLGIVCLDHIIVGKNKYYSF 
REEADIL* 

40 Sequence description: 

A] Length: 681 bp - 227 aa (full-length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 120 gene which was identified by LEEP, 



> „ f? a a~~n it-* E«-£t tr^f.- ^~ r ~ -> .r- 
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during cloning and sequence analysis of the full-length ID- 120 gene sequence. 
ATG start codon is preceded by an typical 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence 



ID- 183 

10 



Clone 3-1 lb (ID-121b) 



TGGTTAAAAGTAGTGATAGCTTGTATTCCATCTATTTTAATTGCTTTACCAT 
1 5 TTGATAATTGGTTTGAAGCTCATTTTAATTTCATGATTCCGATTGCAATAGC 
CCTAATCTTTTATGGTTTTGTCTTCATATGGGTTGAAAAACGTAATGCACAC 
CTCAAACCACAGGTAACCGAATTGGCAAGTATGTCTTACAAGACAGCTTTC 
TTGATTGGATGTTTCCAGGTTCTCAGTATTGTTCCGGGAACCAGTCGTTCTG 
GAGCTACTATTTTAGGAGCAATTATTATTGGAACTAGTCGTTCGGTCGCTG 
20 CTGACTTTACTTTCTTCCTTGCCATCCCAACTATGTTTGGTTATAGTGGACT 
TAAGGCGGTTAAATATTTTTTAGATGGTAACGTCTTGAGTTTAGACCAATC 
TTTAATACTTTTAGTAGCAAGTCTGACAGCTTTCGTAGTTAGTTTATATGTT 
ATTCGTTTCTTGACAGACTATGTCAAACGACACGATTTCACCATCTTTGGT 
AAGTATCGTATAGTCTTAGGAAGTTTACTCATCCTCTACTGGTTAGTTGTTC 
25 ATTTATTCTAA 



WLKVVIAC1PSILIALPFDNWFEAHFNFMIPIAIALIFYGFVFIWVEKRNAHLKP 
QVTELASMSYKTAFLIGCFQVLSIVPGTSRSGATILGAIIIGTSRSVAADFTFFLA 
30 IPTMFGYSGLKAVKYFLDGNVLSLDQSLILLVASLTAFVVSLYVIRFLTDYVKR 
HDFTIFGKYRIVLGSLLILYWLVVHLF* 

Sequence description: 

35 A] Length: 579 bp - 193 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-68 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-68 gene sequence 
described in WO 00/06736. N-terminus has yet to be determined. 

40 

ID-184 



Clone 3-llc(ID-121c) 



■•7: ti*-%:V~£ ;r~ir "It K™-! ^"7r c~ g *- u r* .-r^-r ..^ .pft 
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ATGGAAATGAAACAAATCAGTGAAACAACACTGAAAATTACAATTAGTAT 
GGAAGATTTAGAAGATCGTGGTATGGAGCTGAAAGATTTCCTAATCCCTCA 
5 GGAGAAGACTGAGGAATTTTTCTATTCTGTCATGGATGAATTAGACTTGCC 
AGAAAACTTTAAAAATAGTGGTATGTTAAGTTTTCGAGTAACACCTAAAA 
AAGATCGCATTGATGTTTTTGTTACAAAGTCTGAATTAAGTAAAGATTTAA 
ATTTAGAAGAATTAGCAGATTTGGGTGACATTTCAAAAATGTCTCCAGAAG 
ACTTTTTTAAAACCTTGGAACAATCGATGTTGGAAAAAGGGGATACGGAT 
1 0 GCCCATGCC AAATT AGC AGAAATTGAAA ATATGATGGATA AAGCAACTCA 
AGAAGTAGTTGAGGAAAATGTTTCTGAAGAACAACCTGAAAAGGAAGTAG 
AAACGATTGGATATGTTCACTATGTCTTTGATTTTGATAATATTGAAGCTGT 
AGTTCGATTTTCACAAACGATTGATTTTCCAATAGAAGCTT 



1 5 MEMKQISETTLKITISMEDLEDRGMELKDFLIPQEKTEEFFYSVMDELDLPENF 
KNSGMLSFRVTPKKDRIDVFVTKSELSKDLNLEELADLGDISKMSPEDFFKTLE 
QSMLEKGDTDAHAKLAEIENMMDKATQEVVEENVSEEQPEKEVETIGYVHY 
VFDFDNIEAVVRFSQTIDFPIEA 



20 

Sequence description: 



A] Length: 547 bp - 182 aa (Partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
25 identified downstream of the 1D-68 gene which was identified by LEEP, 

during cloning and sequence analysis of the full-length ID-68 gene sequence. 
ATG start codon is preceded by an typical 
Shine-Dalgarno sequence. No obvious potential 
leader peptide 
30 sequence 



ID- 185 

35 



Clone 3-16b (ID-122b) 



40 GGAAACCAACGGCCAGTACAATCGTCAAGGGTAGATTATCCTAAACGTAG 
TCGTGCCAAGATTGTAGAAGTTTATTTTAGACAAGCTTCTACTACTGATTA 
TTCTGGTGTTTACAAAGGTTACTATATTGACTTTGAAGCCAAAGAAACCCG 
GCAGAAAACTGCTATGCCTATGAAAAATTTTCATGCTCACCAAATAGAGC 
ACATGGCAAATGTATTACAGCAAAAAGGGATTTGCTTTGTCTTGCTTCATT 



gy^O/g UiiiP 1.ul> ,,!<■ „,«„ 7, ,5 1.,t> d? «! .i..*' M„5 ti^'i •(, i[„ 



TTTCCACACTTAAGGAAACCTATCTACTCCCTGCTAATGAGTTAATTTCATT 
TTATCAGATTGATAAAGGCAATAAATCAATGCCTATTGATTATATCAGAAA 
AAATGGATTTTTCGTAAAGGAGAGTGCCTTTCCTCAAGTCCCTTACTTAGA 
TATTATTGAAGAAAAATTATTAGGCGGTGATTACAATTAA 

5 

GNQRPVQSSRVDYPKRSRAK1VEVYFRQASTTDYSGVYKGYYIDFEAKETRQ 
KTAMPMKNFHAHQIEHMANVLQQKGrCFVLLHFSTLKETYLLPANELrSFYQl 
DKGNKSMPLDYrRKNGFFVKESAFPQVPYLDLlEEKLLGGDYN* 

10 

Sequence description: 

A] Length: 447 bp - 149 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 

15 identified upstream of the ID-122 gene which was identified by LEEP, during 

cloning and sequence analysis of the full-length ID-122 gene sequence. N- 
terminus has yet to be determined 



20 

ID-186 



Clone 3-1 7b (ID-123b) 

25 

GGATCCTAAAAACGCTAAGGTTTATCAAAAAAATGCTGATCAATTTAGTG 

ACAAGGCAATGGCTATTGCAGAGAAGTATAAGCCAAAATTTAAAGCTGCA 

AAGTCTAAATACTTTGTGACTTCACATACAGCATTCTCATACTTAGCTAAG 

30 CGATACGGATTGACTCAGTTAGGTATTGCAGGTGTCTCAACCGAGCAAGA 
ACCTAGTGCTAAAAAATTAGCCGAAATTCAGGAGTTTGTGAAAACATATA 
AGGTTAAGACTATTTTTGTTGAAGAAGGAGTCTCACCTAAATTAGCTCAAG 
CAGTAGCTTCAGCTACTCGAGTTAAAATTGCAAGTTTAAGTCCTTTAGAAG 
CAGTTCCCAAAAACAATAAAGATTACTTAGAAAATTTGGAAACTAATCTTA 

35 AGGTACTTGTCAAATCGTTAAATCAATAG 



DPKNAKVYQKNADQFSDKAMAIAEKYKPKFKAAKSKYFVTSHTAFSYLARR 
YGLTQLGIAGVSTEQEPSAKLKLAEIQEFVKTYKVKTIFVEEGVSPKLAQAVAS 
40 ATRVKIASLSPLEAVPKNNKDYLENLETNLKVLVKSLNQ* 



Sequence description: 
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A] Length: 433 bp - 144 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 123 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID- 123 gene sequence. 
N-terminus has yet to be determined 



ID- 187 



Clone 3-46/47 (ID- 130b) 



ATGAAAAAAGTCATCGATTTAAAAAAACTACAAAAAGCATACGCCTCAGA 

AACTGTTTTAAATAATATTAATTTGGAGGTGTTTAAAGGAGAAATAATTGG 

ATTAATAGGACCCTCTGGAGCAGGGAAATCTACCTTGATTAAAACTATGCT 

TGGCATGGAAAAAGCAGATAAGGGAACAGCTCTTGTTCTTGATACTCAAA 

TGCCAGATCGTAATATTTTAAATCAAATTGGCTATATGGCTCAATCTGATG 

CCTTACACGAGTCTTTAACTGGCTTAGAAAATTTATTATTCTTTGGAAAAA 

TGAAAGGTATTCAAAAAACTGAATTAAAACAGCAGATAACTCATATTTCT 

AAAGTAGTAGATCTAGAAAACCAACTTGATAAATTTGTCTCAGGTTACTCA 

GAAGGTATGAAAAGACGGCTTTCTCTAGCCATCGCCCTACTTGGAAACCCC 

ACAGTTTTAATCCTAGATGAACCTACCGTTGGAATTGATCCATCCTTGAGG 

AGAAAAATCTGGCAAGAGCTAATTAATATTAAGGATGAAGGACGTTCTAT 

CTTTATTACAACCCACGTTATGGATGAAGCAGAATTAACAAGTAAGGTTGC 

ACTACTATTACGTGGAAACATTATTGCCTTTGATACTCCATTACATTTAAA 

AAAACAATTTAATGTGAGTACTATTGAGGAAGTTTTCTTAAAAGCTGAAGG 

AGAATAA 

MKKV1DLKLK1LQKAYASETVLNNINLEVFKGEIIGLIGPSGAGKSTLIKTMLGME 

KADKGTALVLDTQMPDRNILNQIGYMAQSDALHESLTGLENLLFFGKMKGIQ 

KTELKQQITHISKVVDLENQLDKFVSGYSEGMKRRLSLAIALLGNPTVLILDEP 

TVGIDPSLR.RKIWQELINIKDEGRSIFITTHVMDEAELTSKVALLLRGNIIAFDTP 

LHLKKQFNVSTIEEVFLKAEGE* 



Sequence description: 

A] Length: 717 bp - 239 aa (Possible full-length sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID-130 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID-130 gene sequence. ATG 
start codon is preceded by a possible 
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Shine-Dalgarno. No obvious potential leader 
peptide sequence 

5 

ID- 188 



Clone 3-83b (ID-144b) 

10 

ATGGTACAAATGATACATGATATGATTAAAACAATTGAGCATTTTGCTGAG 
ACACAAGCTGATTTTCCAGTGTATGATATTTTAGGGGAAGTCCATACTTAT 
GGACAACTTAAAGTAGACTCTGACTCTCTAGCTGCTCATATTGATAGCCTA 
GGCCTTGTTGAAAAATCACCTGTCTTAGTATTCGGTGGTCAAGAATATGAA 

1 5 ATGTTGGCGACATTTGTTGCTTTAAC AAAGTC AGGGC ATGCTTATATACCG 
GTTGACCAACACTCTGCTTTGGATAGAATACAGGCTATTATGACAGTTGCT 
CAACCAAGCCTTATCATTTCAATTGGTGAATTTCCTCTTGAAGTTGATAAT 
GTCCCAATCCTAGACGTTTCTCAAGTTTCAGCTATTTTTGAAGAAAAGACT 
CCTTATGAGGTAACACATTCTGTTAAAGGTGATGATAATTACTATATTATT 

20 TTCACTTCAGGGACTACTGGTTTACCAAAAGGTGTGCAAATTTCACATGAC 
AATTTATTGAGCTTTACAAATTGGATGATTTCTGATGATGAGTTTTCAGTTC 
CTGAAAGACCGCAAATGTTGGCTCAACCC 

MVQMIHDMIKTIEHFAETQADFPVYDILGEVHTYGQLKVDSDSLAAHIDSLGL 
25 VEKSPVLVFGGQEYEMLATFVALTKSGHAYIPVDQHSALDRIQAIMTVAQPSL 
IISIGEFPLEVDNVPILDVSQVSAIFEEKTPYEVTHSVKGDDNYYIIFTSGTTGLP 
KGVQISHDNLLSFTNWMISDDEFSVPERPQMLAQP 



30 Sequence description: 



A] Length: 592 bp - 197 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
35 identified downstream of the ID-144 gene which was identified by LEEP, 

during cloning and sequence analysis of the full-length ID-144 gene sequence. 
Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. No obvious 
leader peptide sequence 
40 This orf is not in frame with nuc 



ID- 189 
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Clone 3-86b (ID-145b) 

5 

ATGGAAAATCATCGTTATGAAGATGAAGGTAAATTCCAGCGTAAGATGAC 
CAGTCGTCATCTCTTTATGTTATCGCTAGGTGGTGTTATCGGGACTGGGCTT 
TTCTTGAGTTCAGGTTATACCATTGCACAGGCTGGTCCGCTTGGAGCTGTG 
CTGTCTTATTTGATTGGTGCCGTTGTGGTTTATTTGGTCATGCTATCACTTG 

1 0 GGGAATTGGCGGTTGCCATGCCGGTGACGGGGTCATTCC ACACTTATGCCA 
CTAAGTTTATCAGTCCTGGAACAGGTTTTACTGTTGCTTGGCTATATTGGAT 
TTGTTGGACGGTCGCCTTGGGGACTGAATTTTTAGGTGCTGCCATGCTGAT 
GCAGCGCTGGTTCCCAAATGTGCCGGCTTGGGCATTTGCTTCCTTTTTTGCC 
CTTGTGATTTTTGGTTTAAATGCTCTTAGCGTACGCTTTTTTGCAGAAGCAG 

1 5 AGTCTTTCTTCTC AAGTATTAAGGTTATTGCTATCATTATCTTTATTATCTTG 
GGCTTAGGTGCTATGTTTGGTCTAGTTTCCTTTGAAGGTCAGCACAAGGCT 
ATTCTCTTCACTCATCTGACTGCCAATGGTGCCTTTCCAAATGGTATCGTTG 
CAGTTGTCTCAGTCATGTTGGCTGTTAACTATGCCTTCTCTGGTACTGAGTT 
AATTGGTATTGCGGCTGGTGAAACGGATAATCCCAAAGAAGCTGTACCAA 

20 GGGCTATTAAAACGACAATCGGTCGCTTGGTTGTTTTCTTTGTACTGACAA 
TTGTTGTCCTAGCTTCGCTATTGCCAATGAAAGAGGCAGGCGTATCCACAG 
CACCATTCGTTGATGTCTTTGACAAGATGGGAATCCCTTTTACGGCGGATA 
TCATGAACTTCGTTATCTTGACAGCCATCCTGTCTGCTGGTAACTCAGGTCT 
CTACGCATCAAGCCGTATGCTCTGGTCCCTTGCCAATGAAGGTATGTTGTC 

25 AAAATCTGTTGTGAAAATCAATAAACACGGTGTCCCAATGCGTGCTCTTCT 
CTTGTCAATGGCAGGAGCAGTGCTGTCGCTCTTTTCAAGTATTTACGCTGC 
AGACACAGTTTATCTAGCCTTGGTTTCAATCGCGGGCTTTGCTGTTGTTGTC 
GTATGGCTAGCCATTCCAGTCGCACAAATCAATTTCCGCAAGGAATTC 

30 MENHRYEDEGKFQRKMTSRHLFMLSLGGVIGTGLFLSSGYTIAQAGPLGAVL 
SYLIGAVVVYLVMLSLGELAVAMPVTGSFHTYATKFISPGTGFTVAWLYWIC 
WTVALGTEFLGAAMLMQRWFPNVPAWAFASFFALVIFGLNALSVRFFAEAES 
FFSSIKVIAIIIFIILGLGAMFGLVSFEGQHKAILFTHLTANGAFPNGIVAVVSVM 
LAVNYAFSGTELIGIAAGETDNPKEAVPRAIKTTIGRLVVFFVLTIVVLASLLPM 

35 K.EAGVSTAPFVDVFDKMG1PFTADIMNFVILTAILSAGNSGLYASSRMLWSLA 
NEGMLSKSVVKINKHGVPMRALLLSMAGAVLSLFSSIYAADTVYLALVSIAGF 
AVVVVWLAIPVAQ1NFRKEF 



40 

Sequence description: 



A] Length: 1 126 bp - 393 aa (partial gene 
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sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-145 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-145 gene sequence. 
Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. Possesses a 
possible leader peptide sequence. 



ID- 190 



Clone 3-94b 



TCAGAAAATGCAGAGGCAGCAACGGTTGCCACAAACTTGGTTACCAAAGG 

AGCTAATGTCATTATCGGACCAGCAACATCGGGTGCAGCTGCATCTTCAAC 

TCCAAAAGTAAATGCAGCAGCAGTTCCAATGATTGCACCTGCTGCGACAC 

AAGACAATTTAGTCTATGGTTCTGATGGAAAAACCTTAAATCAGTATTTCT 

TCCGAGCTACTTTTGTCGATAATTATCAAGGAAAGCTATTGTCTCAGTATG 

CTACAGACAACCTTAAAGCTAAAAAAGTTGTTCTATTTTATGATAATTCAT 

CAGATTACTCAAAGGGGGTAGCAAAATCATTTAAGGAAAGTTATAGTGGA 

AAAATTGTTGATAGTATGACATTCTCCGCTGGTGATACTGATTTCCAAGCG 

TCATTGACTAAGTTGAAAGGGAAAGAATATGATGCTATTGTGATGCCAGG 

TTACTATACCGAGACAGGATTAATAGTTAAGCAAGCGCGTGATTTAGGTAT 

CTCTAAACCGGTTCTTGGGCCTGATGGTTTTGATAGTCCGAAATTTGTGCA 

ATCGGCAACACCTGTAGGAGCTTCAAACGTTTATTATTTGACAGGTTTCAC 

TACACAAGGATCAACCAAAGCTAAAGCT 

SENAEAATVATNLVTKGANVIIGPATSGAAASSTPKVNAAAVPM1APAATQD 

NLVYGSDGKTLNQYFFRATFVDNYQGKLLSQYATDNLKAKLKVVLFYDNSSD 

YSKGVAKSFKESYSGKIVDSMTFSAGDTDFQASLTKLKGKEYDAIVMPGYYT 

ETGLIVKQARDLGISKPVLGPDGFDSPKFVQSATPVGASNVYYLTGFTTQGST 

KAKA 



Sequence description 



A] Length: 637 bp - 231 aa (partial sequence) 
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B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 149 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length 1D-149 gene sequence. N- and 
C-termini have yet to be determined 



TD-191 

10 



Clone 2-c94b (ID-153b) 



TTGGGACTTAAAGACCATGCTTTAGTCTATCCATTTTCATTATCTGGGGGG 
1 5 C AAAAGCAACGTGTCGC ACTAGCTCGTGCGATGATGATTGATCC ACAGATT 
ATTGGTTATGATGAGCCAACTAGCGCTCTTGATCCAGAGTTGCGTCAAGAA 
GTAGAAAAACTAATTTTACAAAATAGAGAAACAGGTATGACACAAATTGT 
AGTAACACATGATCTTCAATTTGCTGAAAGTATATCTGATACGATTCTCAA 
AATTAATCCTAAGTAG 

20 

MGLKDHALVYPFSLSGGQKQRVALARAMMIDPQIIGYDEPTSALDPELRQEV 
EKLILQNRETGMTQIVVTHDLQFAESISDTILKINPK* 



25 



Sequence description 



A] Length: 270 bp - 90 aa (partial sequence) 
30 B] This gene sequence was not identified using the LEEP system. It was 

identified upstream of the ID-153 gene which was identified by LEEP, during 
cloning and sequence analysis of the ID-153 gene sequence. 
N-terminus has yet to be determined 



35 



40 



ID-192 

Clone 2-clb(lD-155b) 



ATGACTAATATCTCAGATGTTCCAAAAGCTATTAGAACACAGGCACAGTAT 
GTTCTCTTGGGAATGAGAGTTATGGATCAGTCGGTATTACCGAAAACATAT 
AATTCAAAAGAACCTTATTTGAAACCAGATATGATTTATATTCATGATAGA 
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AGACAAGAGACAATGCTTAAAATCACTCAAGAAATAGAAATGGAGCATTG 
A 

MTNISDVPKLAIRTQAQYVLLGMRVMDQSVLPKTYNSKEPYLKLPDMIYIHDRR 
5 QETMLKITQEIEMEH* 



Sequence description 



A] Length: 204 bp - 68 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 155 gene which was identified by LEEP, during 

15 cloning and sequence analysis of the ID- 155 gene sequence. 

ATG start codon is preceded by a potential typical Shine-Dalgarno sequence. 
Has a 

typical leader peptide. N-terminus has yet to be 
verified 



iD-193 

25 Clone 2-54altb (ID-1 72b) 

AAGCTTGCATGCCTGCAGGTCGACTCTAGAGGATCTTGGGGAATATAAATT 
TGGATTTCATGACGATGTAAAGCCAATTTATTCTACGGGAAAAGGTCTAAA 
TGAGGCTGTTATTCGTGAGTTATCTGCAGCTAAGGGTGAACCTGAGTGGAT 

30 GTTGGACTTTCGTCTAAAATCCTTGGAAACGTTTAATAAAATGCCGATGCA 
GACCTGGGGAGCAGATTTATCAGATATTGATTTTGATGATATTATTTATTA 
TCAAAAAGCATCTGATAAACCTGCGCGTGATTGGGATGATGTTCCAGAAA 
AAATCAAAGAAACTTTTGAAAGAATTGGGATTCCAGAAGCTGAAAGAGCC 
TATCTTGCAGGAGCATCAGCACAATATGAATCAGAAGTAGTTTATCACAAT 

3 5 ATG AAAG AAG A AT ATG AT AAGCTGGGT ATTGTTTTTACGG AT ACTG ACTCT 
GCACTTAAAGAGTACCCAGAGCTATTCAAAAAATATTTTGCTAAACTTGTC 
CCTCCAACAGATAATAAATTAGCTGCTCTGAACTCTGCTGTATGGTCAGGT 
GGAACATTTATTTATGTTCCTAAAGGTGTTAAGGTGGATATTCCACTTCAA 
ACTTACTTCCGTATTAATAATGAAAATACTGGACAATTTGAACGTACTCTC 

40 ATTATTGTTGATGAGGGAGCAAGTGTTCACTATGTTGAAGGTTGTACCGCC 
CCAACTTATTCTTCAAATAGTTTACATGCAGCTATAGTTGAAATTTTTGCAC 
TTGATGGAGCTTATATGCGCTATACGACTATTCAAAATTGGTCCGATAATG 
TCTATAATTTAGTGACAAAACGTGCTACCGCTAAAAAAGATGCAACAGTT 
GAGTGGATAGATGGAAATCTAGGAGCTAAAACAACAATGAAATACCCATC 
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GGTTTACCTTGATGGTGAAGGAGCACGTGGCACGATGTTGTCTATTGCTTT 

TGCAAACAAAGGACAACACCAAGATACGGGTGCAAAGATGATTCATAATG 

CCCCCCATACTAGTTCATCCATTGTCTCTAAATCAATTGCTAAGGGTGGGG 

GAAAAGTTGATTATCGAGGTCAAGTGACATTTAATAAAGATTCCAAAAAA 

TCAGTGTCACATATAGAATGTGACACCATATTGATGGATGATATTTCAAAA 

TCAGATACCATACCGTTTAATGAGATTCATAATTCACAGGTTGCTTTAGAG 

CATGAAGCAAAGGTGTCTAAGATTTCTGAAGAGCAACTGTACTACTTGATG 

AGTCGAGGTTTATCTGAAGCTGAAGCAACAGAAATGATTGTTATGGGGTTT 

GTTGAGCCCTTTACGAAAGAATTACCAATGGAATATGCGGTAGAGTTAAA 

TCGTTTAATTTCCTATGAAATGGAAGGTTCAGTTGGTTAA 



MHACRSTLEDLGEYKFGFHDDVKPIYSTGKGLNEAVIRELSAAKGEPEWMLD 

FRLKSLETFNKMPMQTWGADLSD1DFDD1IYYQKASDKPARJDWDDVPEKJKE 

TFERIGIPEAERAYLAGASAQYESEVVYHNMKEEYDKLGIVFTDTDSALKEYP 

ELFKKYFAKLVPPTDNKLAALNSAVWSGGTFIYVPKGVKVDIPLQTYFRINNE 

NTGQFERTLIIVDEGASVHYVEGCTAPTYSSNSLHAAIVEIFALDGAYMRYTTI 

QNWSDNVYNLVTKRATAKKDATVEWIDGNLGAKTTMKYPSVYLDGEGARG 

TMLSIAFANKGQHQDTGAKMIHNAPHTSSSIVSKSIAKGGGKVDYRGQVTFN 

KDSKKSVSHIECDTILMDDISKSDTIPFNEIHNSQVALEHEAKVSKISEEQLYYL 

MSRGLSEAEATEMIVMGFVEPFTKELPMEYAVELNRLISYEMEGSVG* 



Sequence description: 

A] Length: 141 1 bp - 469 aa (Possible full-length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-72 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-72 gene sequence. 
No obvious Shine Dalgarno sequence upstream of 

TTG start codon insufficient sequence data). N 
terminus needs verification. 



ID- 194 

Clone 3-lb (ID-81b) 

ATGATAGAATTCTTTTCTAATATCAGAACAGAGATTCCGCAGATGCCTTTA 
CTTATCCATAGTTTGATTTTATCTGTCTTACCTTTTCTGATGTGGCTGACTTT 
GGTTAATAGAGATAAGCCTTTGTATAAAACTATTTGGAGTATCCTTTTAGG 
ACTTCAGTTAATTACGATTTATACTTGGTTTTTCTGGGCAAAATTGCCTTTA 
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TCTGAAAGTCTTCCCCTTTACCATTGTCGAATAGGCATGTTTGTCGGTCTCT 
TA 

MIEFFSNIRTELPQMPLLIHSLILSVLPFLMWLTLVNRDKPLYKTIWSILLGLQLI 
5 TIYTWFFW AKLPLS ES LPLYHCRIGMF VGLL 



Sequence description 

A) Length: 261 bp - 87 aa (partial gene sequence) 
10 B) This gene sequence was not identified using the LEEP system. It was identified 

downstream of the ID-81 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-81 gene sequence. Sequence Characteristics: 
Possesses a potential leader peptide sequenceOrf is preceded by a potential Shine- 
Dalgarno sequence. 



ID- 195 

20 

Clone RS-55b 



AAGCTTGTGCAAAGTATTAAAGAGATAGGATTAGCTAATGCGCATTTATTA 
25 GCTGTTGCTCCGACAGGGTCAATCAGTTATCTTTCTTCTTGTACTCCGAGCC 
TTCAACCGGTTGTATCACCTGTCGAAGTACGCAAGGAAGGAGCACTGGGG 
AGGGTTTATGTAGCTGCTTATAAGATTGATGCAGATAATTATGTCTACTAC 
AAAAAAGGAGCTTATGAAGTGGGATCTGAGGCGATTATCAATATTGCAGC 
TGCCGCTCAAAAACACATTGATCAAGCTATTTCGTTAACGCTTTTCATGAC 
30 AGATCAAGCAACTACGCGAGATTTAAATAAAGCCTATATTCAAGCATTTA 
AACAAAAATGTGCCTCTATTTATTATGTACGAGTGAGACAGGACATCCTAG 
AAGGTAGCGAGAGTTATGATGATATGCTGGATGATTTCACTTCATCGGACT 
TAGAAGACTGTCAATCCTGCATGATTTAA 

35 

>KLVQSIKEIGLANAHLLAVAPTGSISYLSSCTPSLQPVVSPVEVRKEGALGRV 
YVAAYKIDADNYVYYKKGAYEVGSEAIINIAAAAQKHIDQAISLTLFMTDQAT 
TRDLNKAYIQAFKQKCASIYYVRVRQDILEGSESYDDMLDDFTSSDLEDCQSC 
MI* 

40 



Sequence description: 
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A] Length 486 bp - 162 aa (Partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was identified 
upstream of the ID-87 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-87 gene sequence. N-terminus to be 

5 determined. 



ID- 196 

10 

Clone RS-59(ID-90b) 



GTGAGGACATATATTACAAACTTGAATGGACATTCAATCACTAGTACAGC 
1 5 ACAAATAGCTCAAAACATGGTAAC AGATATAGCAGTAAGCTTAGGTTTTC 
GTGAGCTGGGAATACATTCTTATCCGATTGATACTGATTCTCCTGAGGAAA 
TGAGTAAGCGTTTAGATGGAATCTGTTCCGGACTTAGAAAAAATGATATTG 
TCATATTTCAGACACCTACATGGAACACTACAACTTTTGATGAAAAATTAT 
TTCACAAATTAAAAATATTTGGTGTAAAGATTGTTATTTTTATACATGATGT 
20 TGTACCGCTAATGTTTGATGGAAATTTTTATTTGATGGATAGAACTATAGC 
TTATTATAATGAAGCAGATGTTTAATAGCCCCTAGTCAAGCAATGGTCGAT 
AAGCTT 

MRTYITNLNGHSITSTAQIAQNMVTDIAVSLGFRELGfflSYPIDTDSPEEMSKRL 
25 DGICSGLRKNDIVIFQTPTWNTTTFDEKLFHKLKIFGVK1VIFEHDVVPLMFDGN 
FYLMDRTIAYYNEADVLIAPSQAMVDKL 



Sequence description: 

30 

A] Length: 414 bp - 138 aa(partial gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-90 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-90 gene sequence. 

35 No obvious signal peptide, but a 

possible Shine Dalgarno sequence is present 
upstream of ATG start codon. C-terminus has yet 
to be determined. 

40 

ID- 197 



Clone RS-59c (ID-90c) 
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CATGGAAATGAAGTTGATGATGTTATTAGAAGGGCATTTGAATATAATCAC 
CTTATCTTTGCTTTTGATAATACCTGTCATAACAGAGAGTTAGTATTAGATA 
5 GCAATATCATTTCTCACACAACCTGTGAACAATTGATAAATTTAATGAAAA 
ATTTATCAGGCTCCATTATGTATTTGCTAGAGCAACAAAGAGAACAAACA 
AGTAATGAAACAAAAGAGCGTTATAAAGAAATATTAGGAGGGTATGGAA 
ATGCCTAA 

1 0 HGNEVDDVIRRAFEYNHLIFAFDNTCHNRELVLDSNriSHTTCEQLTNLMKNLS 
GSIMYLLEQQREQTSNETKERYKEILGGYGNA* 



15 Sequence description: 

A] Length: 261 bp - 87 aa(partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID-90 gene which was identified by LEEP, during 

20 cloning and sequence analysis of the full-length ID-90 gene sequence. N- 

terminus has yet to be determined 



25 



40 



ID- 198 



Clone RS-70b (ID-93b) 



ACATTTTTATATTATGTATTTGAAGACGTAGCCACCCAGTCAAATATGACT 
30 GGGAAGATTTTTAGTATGTCTAAAGAAGAGTTGTCATATTTACCCGTTATT 
AAACTTTTTAAGAATCAAGGTGTATACAACGGCTTGATTGGTCTATTCCTC 
CTTTATGGGTTATATATTTCACAGAATCAAGAAATTGTAGCTATTTTTTTAA 
TCAATGTGTTGCTAGTTGCTGTTTATGGTGCTTTGACAGTTGATAAAAAAA 
TCTTATTAAAACAGGGTGGTTTACCTATATTAGCTCTTTTAACATTCTTATT 
35 TTAA 

TFLYYVFEDVATQSNMTGKIFSMSKEELSYLPVIKLFKNQGVYNGLIGLFLLY 
GLYISQNQEIVAIFL1TMVLLVAVYGALTVDKKILLKQGGLPILALLTFLF* 



Sequence description: 

A] Length: 312 bp - 104 aa (partial gene sequence) 
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B] This gene sequence was not identified using the LEEP system. It 
was identified upstream of the ID-93 gene which was identified by 
LEEP, during cloning and sequence analysis of the full-length ID-93 
gene sequence. 
5 N-terminus has yet to be determined 



1D-199 

10 Clone RS-70c (ID-93c) 

ATGAAATTAAGTGTCCTTGATTATGGGCTTATTGATTATGGAAAAACTGCA 
AGTGATGCAATACAAGAAACGATTCTTTTATCACAAGAGGCGGAGCAACT 
AGGCTATCATCAATTTTGGGTGGCTGAACATCACGGTGTTAAGGCATTCAG 

1 5 TATTAGCAATCCAGAATTAATGATAATGCATTTGGCTAACCAGACTAAATC 
TATCAAAATTGGCTCTGGAGGTATAATGCCTCTGCACTATAGTAGTTTTAA 
ACTCGCGGAGACTCTCAAGACATTAGAGACATGTCATCCGAATCGAGTAA 
GTATTGGTTTAGGAAATTCACTAGGGACAGTTAAAGTTTCAAATGCACTTC 
GTAGCTTACATAAAGCACATGATTACGAAGAGGTACTGGAGGAATTGAAG 

20 TCATGGCTTATTGATGAATCATCCAGTAAGGAACCATTAGTTCAACCGACT 
CTTTCTAGCTTCCCAGACTTATATGTGTTGGGGAGTGGTCAAAAATCAGCT 
TATTTAGCGGCTAAACTTGGCTTAGGCTTTACCTTCGGTGTTTTTCCTTTTA 
TGGACAAAGACCCATTGACAGAAGCTAAA 

25 MKLSVLDYGLTDYGKTASDAIQETILLSQEAEQLGYHQFWVAEHHGVKAFSrS 
NPELMIMHLANQTKSIKIGSGGIMPLHYSSFKLAETLKTLETCHPNRVSIGLGN 
SLGTVKVSNALRSLHKAHDYEEVLEELKSWLIDESSSKEPLVQPTLSSFPDLYV 
LGSGQKSAYLAAKLGLGFTFGVFPFMDKDPLTEAK 

30 

Sequence description: 

A] Length: 588 bp - 196 aa (partial) 

B] This gene sequence was not identified using the LEEP system. It was identified 
35 downstream of the ID-93 gene which was identified by LEEP, during cloning and 

sequence analysis of the full-length ID-93 gene sequence. No obvious signal peptide, 
but Shine Dalgarno sequence upstream of the ATG start codon. 



